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PEpAxrns c virus ffscv) yoLYppTODBs. 

10 Technical Field 

The invention relates to materials and methodologies for managing 
the spread of hepatitis C vims (HCV) infection. More specifically, it relates to 
polypeptides useful as immunological reagents in the detection, prevention and 
treatment of HCV infections. 

15 

HCV was first identified and characterized as a cause of non-A, 
non-B hepatitis (NANBH) by Houghton et al. Tim led to the disclosure of a 
number of general and specific polypeptides useful as immunological reagents. 

20 See, e.g., Houghton et al., EPO Pub. No. 318,216; Houghton et al., EPO Pub. 
No. 388,232; Choo et al., Science (1989) 244:359-362; Kuo et al., Science (1989) 
244:362-364; Houghton et al. Hepatology H99D 14:381*388. These 
publications provide the ait with an extensive background on HCV generally, as 
well as the manufacture and uses of HCV polypeptide immunological reagents. 

25 For brevity, therefore, the disclosure of these publications in particular are 
incorporated herein by reference. 

Others have readffly applied and extended the work of Houghton et 
al. See, e.g., ffighfidd et al., UK Pat. App. 2,239,245 (The Wellcome 
Foundation Ltd.); Wang, EPO Pub. No. 442,394 (United Biomedical Inc.); Leung 

30 et al., EPO Pub. No. 445,423 (Abbott Laboratories); Habits et aL, EPO Pub. No. 
451,891 (Akzo N.V.); Reyes et al., PCT Pub. No. WO 91/15516 (Genelabs Inc.); 
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Maki et aL, EPO Pub. No. 468,657 (Tonen Corp.); and Kamada et aL, EPO Pub. 
No. 469,348 (Shionogi Seiyaku KX). 

Sensitive, specific methods for screening and identifying carriers of 
HCV and HCV-contaminated hlood ox blood products are an important advance in 
5 me<ficine. Posttransfusion hepatitis (PTH) occurs in approximately 10% of 
transfused patients, and HCV has accounted for up to 90% of these cases. The 
major problem in this disease is the frequent progression to chronic liver damage 
(25-55%). Patient caie as well as the prevention of transmission of HCV by 
Wood and blood products or by close personal contact require reliable diagnostic 

10 and prognostic tools, such as HCV polypeptides, to detect antibodies related to 
HCV. Such polypeptides are also useful as vaccines and immunotherapeutic 
therapeutic agents for the prevention and/or treatment of the disease. 

Since HCV is a relatively new agent, a continuing need exists to 
define nt Wfrwnal immunological reagents that will allow further study of the 

15 clinical course of disease and the epidemiology of HCV in the population. 

Disclosure of the Invention 

The invention pertains to the characterization of new HCV epitopes. 
The characterization of these epitopes permits the manufacture of polypeptide 

20 products which reacted immunologically with antibodies to HCV and/or generate 
anti-HCV antibody production in vivo. These polypeptide products are useful as 
standards or reagents in diagnostic tests and/or as components of vaccines. Anti- 
bodies, including for example both polyclonal and monoclonal, directed against 
HCV epitopes contained within these polypeptide sequences are also useful 

25 reagents, for example, in diagnostic tests, as therapeutic agents, for screening of 
antiviral agents, and for the isolation of isolation/purification of HCV polypeptides 
or particles. 

la its broadest sense, die present invention is directed to 
polypeptides containing the newly characterized HCV epitopes disclosed herein, 
30 methods of manufacturing such polypeptides (e.g., recombinant and synthetic 
methods), methods of using such polypeptides (e.g., diagnostic, vaccine, and 
therapeutic), and articles of manufacture, compositions or formulations adapted to 
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such uses (e.g., polypeptides fixed to an immunoassay or other support, oral or 
injectable pharmaceutical compositions). Similarly, antibodies (polyclonal, 
monoclonal, or equivalents such as binding fragments, single-chain antigen-binding 
proteins, etc.) to the HCV epitopes disclosed herein are also included within the 
5 scope of the present invention, as well as methods of making such antibodies, 
methods of using such antibodies (e.g., diagnostic, vaccine, and therapeutic), and 
articles of manufacture, compositions or formulations adapted to such uses (e.g., 
antibodies fixed to an immunoassay or other support, oral or injectable 
pharmaceutical compositions)* 

10 Otter aspects of the invention pertain to kits for analyzing samples 

for the presence of an HCV antigen comprising the above antibodies in a suitable 
container. Still other aspects of the invention pertain to kits for analyzing samples 
for the presence of an antibodies directed against an HCV antigen comprising a 
polypeptide as described above m a suitable container. 

15 Still other aspects of the invention axe: a method for producing a 

polypeptide containing the newly disclosed HCV epitopes comprising incubating 
host cells transformed with an expression vector containing a sequence encoding a 
polypeptide containing the HCV epitope under conditions which allow expression 
of said polypeptide; and a polypeptide containing such an HCV epitope produced 

20 by tins method. 

Immunoassays are also included in the invention. These include an 
immunoassay for detecting an HCV antigen comprising incubating a sample 
suspected of containing an HCV antigen with an antibody as described above 
under conditions which allow the formation of an antigen-antibody complex; and 

25 detecting an antigen-antibody complex containing the antibody* An immunoassay 
for detecting anti-HCV antibodies comprising incubating a sample suspected of 
containing anti-HCV antibodies with a polypeptide as described above, under 
conditions which allow the formation of an antibody-antigen complex; and 
detecting the antibody-antigen complex containing die polypeptide. 

30 Also included in the invention are vaccines for treatment of HCV 

infection comprising an immunogenic peptide containing an HCV epitope 
described herein. 
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Yet another aspect of the invention is a method for producing 
antibodies to HCV comprising administering to an individual a n isolated 
immunogenic polypeptide containing an HCV epitope described herein in an 
amonnt sufficient to produce an immune response. 
> The above aspects of the present invention are accomplished by the 

discovery of HCV epitopes of the formula 




wherein aa denotes an amino acid; 
x and y are integers ^fidftnat y-x & 
10 avaa, indicates a portion of the amino acid sequence of Figure 1; 

and 

x is selected from the group consisting of 23-34, 36, 66-79, 81-94, 
96-98, 101-103/^3 ^9^91^ 06. 223, 232, 256, 286, 297-299, 321, 347, 357, 
413, 414, 432, 465-471, 480484, 501, 502, 521, 540-54S, 579, 594-599, 601- 

15 613, 641, 662-665, 685, 705, 706, 729, 782-789, 801, 351-855, 893, 916, 928, 
946, 952-954, 1026, 1072, 1109, 1112-1117, 1218, 1240 ; 1280-1285, 1322, 1338, 
1371, 1384, 1410, 1411, 1454, 1492, 1493, 1532-1535, 1560, 1561, 1566-1568, 
1571-1577, 1601-1607, 1615-1620, 1655, 1695, 1710-1712, 1728, 1729, 1758- 
1762, 1781, 1808, 1821, 1851, 1880, 1908-1913, 1925, 1940-1948, 1951, 1966- 

20 1969, 1999, 2001-2004, 2006-2014, 2024, 2048-2053, 2055-2057, 2071, 2088- 
2093, 2108, 2122-2148, 2165, 2187, 2226-2232, 2244-2249, 2267, 2281-2286, 
2288, 2289, 2325-2327, 2346, 2347, 2349, 2382, 2401, 2417-2422, 2439-2444, 
2446-2456, 2469, 2471-2476, 2495, 2533, 2534, 2573-2578, 2602-2604, 2606- 
2612, 2632-2638, 2660, 2676-2679, 2688-2693, 2707, 2721, 2757-2762, 2779, 

25 2794, 2795, 2797-2799, 2801, 2802, 2817-2843, 2863-2867, 2878-2884, 2886- 
2895. 
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The above objects are also achieved using HCV epitopes of the 

fbnzmla 

aa x -aa y 

wherein aa denotes an amtnn a cffi 
5 x and y arc integers such that y-x £ 6; 

avaay indicates a portion of the amino add sequence of Figure 1; 
and x is selected from the group consisting of 35 (where y is less than 

45), 80 (where y is less than 90). 95 (where y is less than 110), 99 (where y is 
less than 120) ^00 (where y is less than 150)^ 190 (where y is less than 210), 500 

10 (where y is less than 550), 600 (where y is less than 625), 1260 (where y is less 
than 1280), 1569 (where y is less than 1931), 1570 (where y is less than 1590), 
1694 (where y is less than 1735), 1949 (wine y is less than 2124), 1950 (where y 
is less than 1985), 2000 (where y is less than 2050), 2005 (where y is less than 
2025), 2054 (where y is less than 2223), 2250 (where y is less than 2330), 2287 

15 (where y is less than 2385), 2290 (where y is less than 2310), 2345 (where y is 
less than 2375), 2348 (where y is less than 2464), 2445 (where y is less than 
2475), 2470 (where y is less than 2490), 2605 (where y is less than 2620), 2780 
(where y is less than 2830), 2796 (where y is less than 2886), 2800 (where y is 
less than 2850), and 2885 (where y is less than 2905). 

20 In either of the above formula, x-y can less than or equal to 10, 20, 

30, 40 or 50 in some embodiments of the invention. 

Brief Description of the Drawings 

Fig. 1 shows the polyprotein of the HCV prototype isolate HCV1. 
25 Fig. 2 shows a composite cDNA sequence for HCV1. 

Fig. 3 shows the nucleotide consensus sequence of human isolate 
23, variant sequences are shown below the sequence line. The amino acids 
encoded in the consensus sequence are also shown. 

Fig. 4 shows the nucleotide consensus sequence of human isolate 
30 27, variant sequences are shown below the sequence line. The amino acids 
encoded in the consensus sequence are also shown. 
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Kg. S shows the aligned nucleotide sequences of human isolates 23 
and 27 and of HCVL Homologous sequences are indicated by the symbol (*). 
Non-homologous sequences aie in small letters. 

Fig. 6 shows the aligned amino acid sequences of human isolates 23 
5 and 27 and of HCVL Homologous sequences are indicated by the symbol (*). 
Non-homologous sequences are in snail letters. 

Fig. 7 shows a comparison of the composite aligned nucleotide 
sequences of isolates Thorn, EC1, HCT #18, and HCVL 

Fig. 8 shows a comparison of the nucleotide sequences of EC10 and 
10 a composite of the HCV1 sequence; the EC10 sequence is on the line above the 
dots, and the HCV1 sequence is on the line below the dots. 

Fig. 9 shows a comparison of the amino acid sequences 117-308 
(relative to HCV1) encoded in the TEnvL" regions of the consensus sequences of 
human isolates HCT #18, JH23, JH 27, Thome, EC1, and of HCVL 
IS Fig. 10 shows a comparison of the amino acid sequences 330-360 

(relative to HCV1) encoded in the "EmrR" regions of the consensus sequences of 
human isolates HCT #18, JH23, JH 27, Thome, EC1, and of HCVL 

Modes for Carry in g Chit the Invention 

20 

Complete citations to publications referred to herein can be found in 
the ''Background" or "Bibliography 0 sections. 
L ppfiflffiops 

"Hepatitis C virus" or °HCV° refers to the art-recognized viral 
25 species of which pathogenic strains cause NANBH, and a tte n u a t ed strains or 

defective interfering particles derived therefrom. See generally, publications cited 
in the section entitled "Background." The HCV genome is comprised of RNA. It 
is known that SNA containing viruses have relatively high rates of spontaneous 
mutation, i.e., reportedly on the order of 10 3 to 1& 4 per incorporated nucleotide 
30 (Fields & Knipe (1986)). Therefore, since heterogeneity and fluidity of genotype 
are inherent in RNA viruses, there are multiple strains/isolates, which may be 
virulent or avirulent, within the HCV species. The propagation, i de nti fi ca tio n, 
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detection, and isolation of the various HCV strains or isolates has been well 
documented in the literature. Moreover, the disclosure herein allows the 
preparation of diagnostics and vaccines for the various strains/isolates, as well as 
compositions and methods that have utility in screening procedures for anti-viral 
5 agents for phannacologic use, such as agents that inhibit replication of HCV* 

Information on several different strains/isolates of HCV is disclosed 
herein, particularly drain or isolate CDC/HCV1 (also called HCV1). Information 
from one strain or isolate, such as a partial genomic or amino add sequence, is 
sufficient to allow those skilled in the ait using standard techniques to isolate new 

10 strains/isolates and to identify whether such new drains/isolates are HCV. For 
example, several different strains/isolates are described below. These strains, 
which were obtained from a number of human sera (and from different 
geographical areas), were isolated utilizing the information from the genomic 
sequence of HCV1. , 

15 The information provided herein is indicative that HCV may be 

distantly related to die flaviviridae. The Flavivirus family contains a large number 
of viruses which are small, enveloped pathogens of man. The morphology and 
composition of Flavivirus particles are known, and are diseased in Brinton 
(1986). Generally, with respect to morphology, Flaviviruses contain a central 

20 nucleocapsid surrounded by a lipid bilayer. Virions are spherical and have a 

diameter of about 40-50 nm. Their cores are about 25-30 nm in diameter. Along 
the outer surface of the virion envelope are projections that are about 5-10 nm 
long with terminal knobs about 2 nm in diameter. Typical examples of the family 
include Yellow Fever virus, West Nile virus, and Dengue Fever virus. Utey 

25 possess positive-stranded SNA genomes (~ 11,000 nucleotides) that are slightly 
larger than that of HCV and encode a polyprotein precursor of about 3500 amino 
acids. Individual viral proteins are cleaved from this precursor polypeptide. 

The genomic structure and the nucleotide sequence of HCV genomic 
RNA has been deduced. The genome appears to be single-stranded RNA con- 

30 taining - 10 9 000 nucleotides. The genome is positive-stranded, and possesses a 
continuous, transitional open reading frame (ORF) that encodes a polyprotein of 
about 3,000 amino acids. In the ORF, the structural protein(s) appear to be 
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encoded in approximately the first quarter of the N-tenninns region, with the 
majority of the polyprotein responsible for nonstructural proteins. When 
compared with all known viral sequences, small but sig ni fi ca nt co-linear 
homologies are observed with the non-structural proteto ^ 
and with the pestiviruses (which are now also considered to be part of the 

Flavivirus family). 

Based upon the putative amino acids encoded in the m tcl e otide 
sequence ofHCVl and other evidence, possible protein domains of the encoded 
HCV poryprotem, as well as the approximate boundaries, are the following: 



Pmarive Domain 



/Ippmnmate Boundary 
(amnio acid nos.l 



15 



E, (virion envelope protein) 




20 



25 



Ej/NSl (envelope?) 
NS2 (unknown function) 
NS3 (protease?) 
NS4 (unknown function) 
NS5 (polymerase) 



384-800 

800-1050 

1050-1650 

1651-2100 

2100-3011(end) 
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These domains axe, however, tentative* For example, the B1-NS2 border is 
probably in the 750-810 region, and NS3-NS4 border is about 1640-1650. There 
is also evidence that the 191 aa version of C is a precursor that is further 
processed (e.g., to about 170 aa in length), and that the NS2, NS4 and NS5 
proteins are each further processed into two mature proteins. 

Different strains, isolates or subtypes of HCV are expected to 
contain variations at the amino acid and nucleic acids compared with HCV1. 
Many isolates are expected to show much (i.e., mote than about 40%) homology 
in the total amino acid sequence compared with HCV1. However, it may also be 
found that there are other less homologous HCV isolates. These would be defined 
as HCV according to various criteria such as, for example, an ORF of 
ap pro xim ately 9,000 nucleotides to approximately 12,000 nucleotides, encoding a 
polyprotein similar in size to that of HCV1, an encoded polyprotein of similar 
hydrophobic and/or antigenic character to that d HCV1, and the presence of co- 
linear peptide sequences that are conserved with HCVL In addition, the genome 
would be a positive-stranded SNA. 

HCV encodes at least one epitope which is immunologically 
identifiable with an epitope in the HCV1 polyprotein. the epitope is unique to 
HCV when compared to previously known Flavivimses. The uniqueness of the 
epitope may be determined by its immunological reactivity with anti-HCV anti- 
bodies and lack of immunological reactivity with antibodies to known Flavivims 
species. Methods for determining immunological reactivity axe known in the art, 
for example, by radioimmunoassay, by ELBA assay, by hemagglutination, and 
several examples of suitable techniques for assays are provided herein. 
Alternatively, a comparison of the sequence of the HCV epitope to previously 
known sequences of members of the Flavivims family can be used to evaluate 
"uniqueness/ 

In addition to the above, the following parameters of nucleic acid 
homology and amino acid homology are applicable, either alone or in 
combination, in identifying a strain/isolate as HCV. Since HCV strains and 
isolates are evolutionarily related, it is expected that the overall homology of the 
genomes at the nucleotide level may be about 10% or greater, probably will be 



about 40% or greater, probably about 60% or greater, and even more probably 
about 80% or greater; and in addition that there wffl be corteqxnulnig contiguous 
sequences of at least about 13 nucleotides. It should be noted that variable and 
hypervariable regions within the HCV genome; therefore, the homology in these 

5 regions is expected to be significantly less than that in the overall genome. The 
correspondence between die putative HCV strain genomic sequence and, for 
example, the CDC/HCV1 cDNA sequence can be determined by techniques known 
in the ait For example, they can be determined by a direct comparison of the 
sequence information of the polynucleotide from the putative HCV, and the HCV 

0 cDNA sequenced) described herein. For example, also, they can be determined 
by hybridization of the polynucleotides under conditions which form stable 
duplexes between homologous regions (for example, those which would be used 
prior to S t digestion), followed by digestion with single stranded specific 
nuclease(s), followed by size determination of the digested fragments* 

5 because of the evolutionary relationship of the drains or isolates of 

HCV, putative HCV strains or isolates are identifiable by their homology at the 
polypeptide level Generally, HCV strains or isolates are expected to be at least 
10% homologous, more than about 40% homologous, probably more than about 
70% homologous, and even more probably more than about o0% homologous, and 

) " some may even be more than about 90% homologous at the polypeptide level. 
The techniques for determining amino acid sequence homology are known in the 
art For example, the amino acid sequence may be determined directly and 
compared to the sequences provided herein. Alternatively the nucleotide sequence 
of the genomic material of the putative HCV may be determined (usually via a 

J cDNA intermediate), the amino acid sequence encoded therein can be determined, 
and the corresponding regions compared. 

As used herein, a polynucleotide "derived from 1 * a designated 
sequence refers to a polynucleotide sequence which is comprised of a sequence of 
approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, 

) more preferably at least about 10-12 nucleotides, and even more preferably at least 
about 15-20 nucleotides corresponding to a region of the designated nucleotide 
sequence. "Corresponding" means homologous to or complementary to the 
designated sequence. Preferably, the sequence of the region from which the 
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polynucleotide is derived is homologous to or complementary to a sequence which 
is unique to an HCV genome. Whether or not a sequence is unique to the HCV 
genome can be determined by techniques known to those of skin in the ait. For 
example, the sequence can be compared to sequences in databanks (as of the 

' 5 ptmrfty rtott>) i e g ("^nfthanlr tn Hrtpnrnn* whfffhftr it i< pigment in the itninfertftri 

host or other organisms. The sequence can also be compared to the known (as of 
the priority date) sequences of other viral agents, including those which are known 
to induce hepatitis, e.g., HAV, HBV, and HDV, and to members of the 
Haviviridae. The correspondence or non-correspondence of the derived sequence 

10 to other sequences can also be determined by hybridization under the appropriate 
stringency conditions. Hybridization techniques for determining the 
complementarity of nucleic acid sequences are known in the ait See, for 
example, Maniatis ex aL (1982). In addition, mismatches of duplex poly- 
nucleotides formed by hybridization can be determined by known techniques, 

IS including for example, digestion with a nuclease such as SI that specifically 
digests single-stranded areas in duplex polynucleotides. Regions from which 
typical DNA sequences may be "derived" include but are not limited to, for 
example, regions encoding specific epitopes, as well as non-transcribed and/or 
naii-ixanslated regions. * 

20 The derived polynucleotide is not necessarily physically derived 

from the nucleotide sequence shown, but may be generated in any manner, 
including for example, chemical synthesis or DNA replication or reverse 
transcription or transcription. In addition, combinations of regions corresponding 
to that of the designated sequence may be modified in ways known in the art to be 

25 consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence "derived from" a 
designated amino acid or nucleic acid sequence refers to a polypeptide having an 
amino acid sequence identical to that of a polypeptide encoded in the sequence, or 
a portion thereof wherein the portion consists of at least 3-5 amino acids, and 

30 more preferably at least 8-10 amino acids, and even more preferably at least 11-15 
amino acids, or which is immunologically identifiable with a polypeptide encoded 
in the sequence. This terminology also includes a polypeptide expressed from a 
designated nucleic acid sequence. 
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A recombinant or derived polypeptide is not necessarily translated 
from a designated nucleic acid sequence; it may be generated in any maimer, 
including for example, chemical synthesis, or expression of a recombinant 
expression system, or isolation from HCV, including mutated HCV. A recombin- 
5 ant or derived polypeptide may incbde one or more analogs of amino acids or 
mwmtmfli amino acids in its sequence. Methods of inserting analogs of amino 
adds into a sequence are known in the art It also may include one or more 
labels, which are known to those of skill in the art. 

The term °iecombinant polynucleotide 0 as used herein intends a 
10 polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by 
virtue of its origin or manipulation: (1) is not associated with all or a portion of a 
polynucleotide with which it is associated in nature, (2) is linked to a 
polynucleotide other than that to which it is linked in nature, or (3) does not occur 
in nature. 

15! The fenn "polynudeotide" as used herein refers to a polymeric form 

of nudeotides of any length, either ribonucleotides or deoxyribonucleotides. This 
term refers only to the primary structure of the molecule. Urns, this term 
includes double- and single-stranded DNA and RNA. ft also includes known types 
of modifications, for example, labels which are known in the art, methylatbn, 

20 "caps", substitution of one or more of the naturally occurring nucleotides with an 
analog, intemudeotide modifications such as, for example, those with uncharged 
KnVagtts (e.g., methyl phosphorates, phosphotriesters, phosphoramidates, 
carbamates, etc) and with charged linkages (&£., phosphonrthioates, phosphoro- 
dhhioates, etc.), those containing pendant moieties, such as, for example proteins 

25 (including for e.g. , nucleases, toxins, antibodies, signal peptides, poly-L-lysine, 
etc), those with intercalates (eg., acridine, psoralen, etc), those containing 
chelators {eg., metals, radioactive metals, boron, oxidative metals, etc.), those 
containing alkylators, those with modified linkages (e.g., alpha anomeric nudeic 
adds, etc), as well as unmodified forms of the polynudeetide. 

30 A "purified" polypeptide refers to the polypeptide being in a state 

that is substantially free of other polypeptides, i.e., in a composition that contains 

of about 50% by wright (desired polypeptide/total polypeptide in 
composition), preferably a minimum of about 70%, and even more preferably a 
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minimum of about 90% of the desired polypeptide, without regard to non* 
piuteinaceous materials in the composition. Techniques for purifying viral poly- 
peptides are known in the ait Purified antibodies are similarly defined 

-Recombinant host ceDs", "host cells", "cells", "cell lines", "cell 
5 cultures 9 9 and other such terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells which can be, or have been, used 
as recipients for recombinant vector or other transfer DNA, and include the 
progeny of the original cell which has been transfected. It is understood that the 
progeny of a single parental cell may not necessarily be completely identical in 

10 morphology or in genomic or total DNA complement as the original parent, due to 
natural, accidental, or deliberate mutation. 

A "repliant* is any genetic dement, e.g. , a plasmid, a 
chromosome, a virus, a cosmid, etc. that behaves as an autonomous unit of 
polynucleotide replication within a cell; Le., capable of replication under its own 

IS control. 

A "vector* is a replicon in which another polynucleotide segment is 
attached, so as to bring about the replication and/or expression of the attached 
segment 

"Control sequence' 1 refers to polynucleotide sequences which are 
20 necessary to effect the expression of coding sequences to which they are ligated. 
Hie nature of such control sequences differs depending upon the host organism; in 
prokaryotes, such control sequences generally include promoter, ribosomal binding 
site, and terminators; in eufcaxyotes, generally, such control sequences include 
promoters, terminators and, in some instances, enhancers* The term "control 
25 sequences* is intended to include, at a minimum, all co mp onents whose presence 
is necessary for expression, and may also include additional components whose 
presence is advantageous, for example, leader sequences. 

"Opexably linked" refers to a juxtaposition wherein the components 
so described are in a relationship permitting them to function in their intended 
30 manner. A control sequence "operably linked" to a coding sequence is ligated in 
such a way that expression of the coding sequence is achieved under conditions 
compatible with the control sequences. 
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An "open reading fame 0 (ORF) is a region of a polynucleotide 
sequence which encodes a polypeptide; this region may present a portion of a 
coding sequence or a total coding sequence. 

A "coding sequence 0 is a polynucleotide sequence which is 
transcribed into mRNA and/or translated into a polypeptide when placed under the 
control of appropriate regulatory sequences. The boundaries of the coding 
sequence are determined by a translation start codon at the 5'-termiims and a 
translation stop codon at the 3'-terminus. A coding sequence can include, but is 
not limited to mRNA, cDNA, and recombinant polynucleotide sequences. 

•Immunologically identifiable with/as" refers to the presence of 
qntqpe(s) and polypeptides© which are also present in the designated 
polypeptide©, usually HCV proteins. Immunological identity may be determined 
by antibody binding and/or competition in binding; these techniques are known to 
those of average skill in the art 
15 As used herein, "epitope 0 refers tc an antigenic dete rminant of a 

polypeptide. An epitope could comprise 3 or more amino acids that defines the 
binding site of an antibody . Generally an epitops consists of at least 5 amino 
acids, and sometimes consists of at least 8 amino acids. Methods of epitope 
mapping are known in the art. 
20 A polypeptide is "immunologically reactive" with an antibody when 

it binds to an antibody due to antibody recognition of a specific epitope contained 
within the polypeptide. Immunological reactivity may be determined by antibody 
binding, more particularly by the kinetics of antibody binding, and/or by competi- 
tion in binding using as competitors) a known polypeptide^) containing an epitope 
25 against which the antibody is directed, lie techniques for determining whether a 
polypeptide is immunologically reactive with an antibody are known in the art 

As used herein, the term "antibody" refers to a polypeptide or group 
of polypeptides which are comprised of at least one antibody combining she. An 
"antibody combining site" or "binding domain" is formed from the folding of vari- 
30 able domains of an antibody molecule(s) to form three-dimensional binding spaces 
with an internal surface shape and charge distribution complementary to the 
features of an epitope of an antigen, which allows an immunological reaction with 
the antigen. An antibody combining site may be formed from a heavy and/or a 



light chain domain (V H and V L , respectively), which farm hypavariahle loops 
which contribute to antigen binding. The term "antibody 0 includes, for example, 
vertebrate antibodies, hybrid antibodies, chimeric antibodies, altered antibodies, 
univalent antibodies, the Fab proteins, and single domain antibodies. 

As used herein, a "single domain antibody" (dAb) is an antibody 
which is comprised of an VH domain, which reacts immunologically with a 
designated antigen. A dAb does not contain a V L domain, but may contain other 
antigen binding domains known to exist in antibodies, for example, the kappa and 
lambda domains. Methods for preparing dAbs are known in the ait. See, for 
example, Ward et al (1989). 

Antibodies may also be comprised of V H and V t domains, as well as 
otter known antigen binding domains. Examples of these types of antibodies and 
methods for their preparation are known in the ait (see, e.£., U.S. Patent No. 
4,816,467, which is incorporated herein by reference), and include the following. 
F6r example, "vertebrate antibodies'' refers to antibodies which are tetramers or 
aggregates thereof, comprising light and heavy chains which are usually 
aggregated in a *Y" configuration and which may or may net have covalent link- 
ages between the chains. In vertebrate antibodies, the amino acid sequences of all 
the chains of a particular antibody are homologous with the Chains found in one 
antibody produced by the lymphocyte which produces that antibody in situ, or in 
vitro (for example, in hyhridomas). Vertebrate antibodies typically include native 
antibodies, for example, purified polyclonal antibodies and monoclonal antibodies. 
Examples of the methods for the preparation of these antibodies are described 
infra. 

"Hybrid antibodies" are antibodies wherein one pair of heavy and 
light chains is homologous to those in a first antibody, while the other pair of 
heavy ami light chains is homologous to those in a different second antibody. 
Typically, each of these two pairs will bind different epitopes, particularly on 
different antigens. This results in the property of "divaience", i.e., the ability to 
bind two antigens simultaneously. Such hybrids may also be formed using 
chimeric chains, as set forth below. 

"Chimeric antibodies", are antibodies in which the heavy and/or 
light chains are fusion proteins. Typically the constant domain of the chains is 
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from one particular species and/or class, and the variable domains axe from a 
different species and/or class. Also included is any antibody in which either or 
both of the heavy or light chains axe composed of combinations of sequences 
mimicking the sequences in antibodies of different sources, whether these sources 
5 be differing classes, or different species of origin, and whether or not die fusion 
point is at the variable/constant boundary. Thus, it is possible to produce 
antibodies in which neither die constant nor the variable region mimic known anti- 
body sequences. It then becomes possible, for example, to construct antibodies 
whose variable region has a higher specific affinity for a particular antigen, or 

10 whose constant region can elicit pnhqmwf complement fixation, or to make other 
improvements in properties possessed by a particular constant region. 

Another example is "altered antibodies", which refers to antibodies 
in which the naturally occurring amino add sequence in a vertebrate antibody has 
besn varied. Utilizing recombinant DNA techniques, antibodies can be redesigned 

15 to obtain desired characteristics. The possible variations are many, and range 
from the changing of one or more amino acids to the complete redesign of a 
region, for example, the constant region. Changes in the constant region, m 
general, to attain desired cellular process characteristics, eg., changes in com- 
plement fixation, interaction with membranes, and other effector functions. 

20 Changes in the variable region may be made to alter antigen binding 

characteristics. The antibody may also be engineered to aid the specific delivery 
of a molecule or substance to a specific cell or tissue site. The desired alterations 
may be made by known techniques in molecular biology, £.£., recombinant 
techniques, she directed mutagenesis, etc. 

25 Yet another example are "univalent antibodies", which are 

aggregates comprised of a heavy chain/light chain dimer bound to the Fc (i.e., 
constant) region of a second heavy chain. This type of antibody escapes antigenic 
modulation. See, e.g., Glennie era/. (1982). 

Included also within the definition of antibodies axe "Fab" fragments 

30 of antibodies. The "Fab" region refers to those portions of the heavy and light 
chains which axe roughly equivalent, or analogous, to the sequences which 
comprise the branch portion of the heavy and light chains, and which have been 
shown to exhibit immunological binding to a specified antigen, but which lack the 
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effector Fc portion. "Fab" includes aggregates of one heavy and one light chain 
(commonly known as Fab*), as well as tetramers containing the ZH and 2L chains 
(referred to as F(ab) 2 ) J which axe capable of selectively reacting with a designated 
antigen or antigen family. "Fab* antibodies may be divided into subsets analogous 
5 to those described above, Le, "vertebrate Fab", "hybrid Fab 0 , 'chimeric Fab°, 
and "altered Fab°. Methods of producing "Fab* fragments of antibodies are 
known, within the ait and inclnde 9 for example, proteolysis, and synthesis by 
recombinant techniques. 

Also included in the tenn "antibodies" are single-chain antigen- 
10 binding (SCA) proteins, such as the type described in die article co-authored by 
Seldom, J. in the June 15, 1992 issue of Cancer Research (as well as articles cited 
therein). 

As used herein, the tenn "immunogenic polypeptide" is a 
polypeptide that elicits a cellular and/or humoral immune response, whether alone 

15 or linked to a earner in the presence or absence of an adjuvant. 

The term "polypeptide" refers to a polymer of amino acids and does 
not refer to a specific length of the product; thus, peptides, oligopeptides, and 
proteins are included within the definition of polypeptide. Ibis term also does not 
refer to or exc&de post-expression modifications of the polypeptide, for example, 

20 glycosylaiion, acetylation, phosphorylation and the like. Included within the 
definition are, for example, polypeptides containing one or more analogs of an 
amino acid (including, for example, unnatural amino acids, ere), polypeptides 
with substituted linkages, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. 

25 "Transformation", as used herein, refers to the insertion of an 

exogenous polynucleotide into a host cell, irrespective of the method used for the 
insertion, for example, direct uptake, transduction, f-mating or electruporation. 
The exogenous polynucleotide may be maintained as a non-integrated vector, for 
example, a plasmid, or alternatively, may be integrated into the host genome. 

30 Treatment" as used herein refers to prophylaxis and/or therapy. 

An "individual", as used herein, refers to vertebrates, particularly 
members of the mammalian species, and includes but is not limited to animals 
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(eg., dogs, cats, cattle, swine, sheep, goat, rabbits, mice, rats, guinea pigs, etc*), 
and primates, inchi<fip g monkeys, dumps, baboons and humans. 

As used herein, the "sense strand" of a nucleic acid contains the 
sequence that has sequence homology to that of mKNA* The "anti-sense strand 0 
5 contains a sequence which is complementary to that of the "sense strand". 

As used herein, a "positive stranded genome" of a virus is one in 
which the genome, whether KNA or DNA, is single-stranded and which encodes a 
viral polypeptide®- Examples of positive stranded RNA viruses include 
Togavmdae, Coronaviridae, Retroviridae, Picomaviridae, and Caliciviridae. 
10 TnrlnHftrf also, are the Flaviviridae, which were foimeriy classified as Togaviradae. 
See Fields & Kmpe (1986). 

As used herein, "antibody containing body component" refers to a 
component of an individual's body which is a source of the antibodies of interest. 
Antibody ™mt*fmng body components are known in the art, and include but are 
15 not limited to, for example, plas&ia, serum, spinal fluid, lynq>h fluid, the external 
sections of the respiratory , intestinal, and genitourinary tracts, tears, saliva, milk, 
white blood cells, and myelomas 

As used herein, a "biological sample" refers to a sample of tissue or 
fluid isolated from an individual, including but not limited to, for example, 
20 plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respir- 
atory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, 
organs, and also samples of in vitro cell culture constituents (including but not 
limited to conditioned medium resulting from the growth of cells in cell culture 
medium, putatively virally infected cells, recombinant cells, and cell components). 

25 

H. Description of the Invention 

The practice of the present invention will employ, unless otherwise 
indicated, conventional techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within die skill of the art. Such 
30 techniques are explained folly in the literature. Src e.g., Maniatis, Fitsch & 
Sambrook, "Molecular Cloning; A Laboratory Manual" (1982); "DNA Cloning, 
Volumes I and IT (D.N Glover ed. 1985); "Oligonucleotide Synthesis" (M J. Gait 
ed, 1984); "Nucleic Add Hybridization" (B.D. Haines & S J. Higgins eds. 1984); 



"Transcription and Translation 0 (BJ>. Hames & SJ. Higgins eds. 1984); 
'Animal Cell Culture" (R.L Freshney ed. 1986); Tmmohilized Cells And 
Enzymes" (IRL Press, 1986); B. Peibal, "A Practical Guide To Molecular 
Cloning 0 (1984); the series, "Methods in Enzymology" (Academic Press, Inc.); 
"Gene Transfer Vectors For Mammalian Cefls" (JJL Miller and M.P. Calos eds. 
1987, Cold Spring Harbor Laboratory), Meft Tfrrymni Vol. 154 and Vol. 155 
(Wu and Grossman, and Wu, eds., respectively), Mayer and Walker, eds. (1987), 
"Immunochemical Methods In Cell And Molecular Biology 0 (Academic Press, 
London); Scopes, (1987) "Protein Purification: Principles and Practice", Second 
Edition (Springer-Vedag, N.Y.); and "Handbook of Experimental Immunology", 
Volumes HV (DM. Weir and C. C. Blackwell eds 1986). AH patents, patent 
applications, and publications mentioned herein, both supra and infra, are hereby 
incorporated herein by reference. 

H.A. Truncated HCV Polypeptides 

The useful materials ami processes of die present invention are 
made possible by the identification below of new HCV epitopes. The knowledge 
of these epitopes (or antigenic regions) allows for construction of polypeptides 
containing truncated HCV sequences which ran be used as immunological 
reagents. 

Truncated HCV amino acid sequences encoding at least one viral 
epitope are useful immunological reagents. For example, polypeptides comprising 
such truncated sequences can be used as reagents in an immunoassay. These 
polypeptides also are candidate subunh antigens in compositions for antiserum 
production or vaccines. While these truncated sequences can be produced by 
various known treatments of native viral protein, it is generally preferred to make 
synthetic or recombinant polypeptides comprising an HCV sequence. Polypeptides 
comprising these truncated HCV sequences can be made up entirely of HCV 
sequences (one or more epitopes, either contiguous or noncontiguous), or HCV 
sequences and heterologous sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion from a recombinant host, 
enhance the immunological reactivity of the HCV epitope(s), or facilitate the 
coupling of the polypeptide to an immunoassay support or a vaccine carrier. See, 



eg., EPO Pub. No. 116,201; U.S. PaL No. 4,722,840; EPO Pub. No. 259,149; 
U.S. PaL No. 4,629,783, the disclosures of which are incorporated herein by 
reference. 

The size of polypeptides comprising the truncated HCV sequences 
5 can vary widely, the "wirninm size being a sequence of sufficient size to provide 
an HCV epitope, while the maximum size is not critical. For convenience, the 
maximum size usually is not substantially greater than that required to provide the 
desired HCV epitopes and function® of the heterologous sequence, if any. 
Typically, the truncated HCV amino add sequence will range from about 5 (or 8) 

10 to about 100 «ntn» acids in length. More typically, however, the HCV sequence 
will be a maximum of about 50 (or 40) amino adds in length, and sometimes a 
maximum of about 20, 25 or 30 amino acids. It is usually desirable to select 
HCV sequences of at least about 8, 10, 12 or 15 amino acids. 

Examples of truncated HCV amino add sequences (octamers) that 

15 are useful as described herein are set forth below in the excsiples. It is to be 

understood that these peptides do not necessarily precisely map one epitope. Non- 
immunogenic portions of the sequence can be defined using conventional 
techniques and deleted from the described sequences. Further, additional trun- 
cated HCV amino add sequences that comprise an epitope or are immunogenic 

20 can be identified as described herein. 

Polypeptide products containing the truncated HCV amino acid 
sequences disclosed below can be prepared as discrete peptides or incorporated 
into a larger polypeptide, and may find use as described herein. In preferred 
applications, truncated sequences from the El and/or E2 domains have applications 

25 in vaccine and therq>eutic products. While generally any of the domains can have 
some diagnostic utility, C, NS3, NS4 and NS5 are particularly preferred, with 
combinations of C epitopes with epitopes from one or more of the NS3, NS4 or 
NS5 domains being particularly preferred. 

30 ILB. Preparation of Polypeptides 

The availability of DNA sequences encoding HCV amino acid 
sequences permits the construction of expression vectors encoding antigenically ac- 
tive regions of the polypeptide (See, e.g., Fig. 2). These antigenically active 



regions may be derived from coat or envelope antigens or from core antigens, or 
from antigens which are non-structural including, for example, polynucleotide 
binding proteins, polynucleotide polymeiase(s), and other viral proteins required 
for the replication and/or assembly of the vims particle. Fragments encoding the 
5 desired polypeptides are derived, for example, from viral cDNA clones using 
conventional restriction digestion or by synthetic methods, and are ligated into 
vectors which may, for example, contain portions of fusion sequences such as 
jS-Galactosidase or superoxide dismutase (SOD), preferably SOD. Methods and 
vectors which are useful for the production of polypeptides which contain fusion 

10 sequences of SOD are described in European Patent Office Publication number 
0196056, published October 1, 1986. Vectors encoding fusion polypeptides of 
SOD and HCV polypeptides, Le. 9 NANHtu, NANB„, and C100-3, which is 
encoded in a composite of HCV cDNAs, are described in Sections IV.B.l, 
TV3.2 9 and IVJB.4, respectively. Any desired portion of the HCV cDNA 

IS containing an open reading frame (or a synthetic version thereof), can bz used to 
express a recombinant polypeptide, such as a mature or fusion protein. 

Alternatively, a polypeptide contain HCV epitopes can be provided 
by chemical synthesis using standard techniques based on the amino acid sequence 
of the figures and the examples. 

20 The DNA encoding the desired polypeptide, whether in fused or 

unfused form, and whether or not containing a signal sequence to permit secretion, 
may be ligated into expression vectors suitable for any convenient host. Both 
eukaryotic and prokaiyotic host systems are presently used in forming recombinant 
polypeptides, and host cell lines is given in EPO Pub. Nos. 318,216. Hie 

25 polypeptide is then isolated from lysed cells or from the culture medium and pur- 
ified to the extent needed for its intended use. Purification may be by techniques 
known in the ait, for example, differential extraction, salt fractionation, 
chromatography on ion exchange resins, affinity chromatography, centrifugation, 
and the like. See, for example, Methods in Enzymology for a variety of methods 

30 for purifying proteins. Such polypeptides can be used as diagnostics, or those 
which give rise to neutralizing antibodies may be formulated into vaccines. Anti- 
bodies raised against these polypeptides can also be used as diagnostics, or for 
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passive immunotherapy, In addition, as discussed below, antibodies to these 
polypeptides are useful for example in isolating and identifying HCV particles* 

The HCV polypeptides may also be isolated from HCV virions and 
truncated (if not already). The virions may be grown in HCV infected cells in 
5 ri ym ft culture, or in an infected hosL 



ILC. Preparation of A ntigenic Polypeptides and Conjugation wftfr Caflfcr 
An antigenic region of a polypeptide is generally relatively 
smalMypicaliy 8 to 10 amino acids or less in length. Fragments of as few as 5 

10 amino acids may characterize an antigenic region. These segments may cor- 
respond to regions of HCV antigen. Accordingly, using the cDNAs of HCV as a 
basis, DNAs encoding short segments of HCV polypeptides can be expressed 
recombinant^ either as fusion proteins, or as isolated polypeptides. In addition, 
short amino acid sequences can be conveniently obtained by chemical synthesis. 

15 in instances wherein the synthesized polypeptide is correctly configured so as to 
provide the correct epitope, but is too small to be immunogenic , the polypeptide 
may be linked to a suitable carrier. 

A number of techniques for obtaining such linkage are known in the 
ait, including die formation of disulfide linkages using N-sucdnimidyl-3-(2- 

20 pyridyltMo)propionate (SPDP) and sucdnimidyl 

4-(N-nudeimido-methy^ (SMCQ obtained from Pierce 

Company, Rockford, Illinois, Of the peptide lads a suHhydryl group, this can be 
provided by addition of a cysteine residue.) These reagents create a disulfide 
linkage between themselves and peptide cysteine residues on one protein and an 

25 amide imfcagft through the epsflon-amino on a lysine, or other free amino group in 
the other* A variety of such disulfide/amide-fomring agents are known. See, for 
example, Tmrmin Rev (1982) 62:185. Other bifunctional coupling agents form a 
thioether rather than a disulfide linkage. Many of these thio-ether-foiming agents 
are commercially available and include reactive esters of 6-maleimidocaproic add, 

30 2-bromoacetic acid, 2-iodoacetic acid, 

4-(N-maleimido-methyl)cydohe«ane-l-carboxylic acid, and the like. The carboxyl 
groups can be activated by combining them with succinimide or l-hydroxyl-2 
mtro-4-sulfonic acid, sodium salt. Additional methods of coupling antigens 



employs the rotavirus/Trinding peptide 0 system described in EPO Pub. No. 
259,149, the disclosure of which is incorporated herein by reference. The 
foreg oi ng list is not meant to be exhaustive, and modifications of the named 
compounds can clearly be used. 
5 Any carrier may be used which does not itself induce the production 

of antibodies harmful to the host. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins; polysaccharides, such as latex 
fanctianaKzed SepharoseP, agarose, cellulose, cellulose beads and die like; 
polymeric amino acids, such as polygbtamic acid, polylysine, and the like; amino 

10 add copolymers; and inactive virus particles, see, for example, Section H.D. 
Especially useful protein substrates are serum albumins, keyhole limpet 
hemocyamn, immunoglobulin molecules, tbyroglobulin, ovalbumin, tetanus toxoid, 
and other proteins well known to those skilled in the art. 

In addition to full-length viral proteins, polypeptides comprising 

15 . 

H.D. Preparation of Hybrid Particle Immunopen? ftnitaini ng HCV Epitopes 

The immunogenicity of the epitopes of HCV may also be enhanced 
by pr ep ari ng them in mammalian or yeast systems fused with or assembled with 
particle-forming proteins such as, for example, that associated with hepatitis B 

20 surface antigen. See, e.g., US 4,722,840. Constructs wherein the HCV epitope 
is linked directly to the particle-forming protein coding sequences produce hybrids 
which are immunogenic with respect to the HCV epitope. In addition, all of the 
vectors prepared include epitopes specific to HBV, having various degrees of 
immunogenicity, such as, for example, the pre-S peptide. Thus, particles 

25 constructed from particle forming protein which include HCV sequences are im- 
munogenic with respect to HCV and HBV. 

Hepatitis surface antigen (HBSAg) has been shown to be formed and 
assembled into particles in S. cerevisiae (P. VaJenzuela et at. (1982)), as well as 
in, for example, mammalian cells (P. Valenzuela et al (1984)). The formation of 

30 such particles has been shown to enhance the immunogenicity of the monomer 
subunit. The constructs may also include the immunodominant epitope of HBSAg, 
comprising the 55 amino adds of the presurface (pre-S) region. Neurath et al 
(1984). Constructs of the pre-S-HBSAg particle expressible in yeast are disclosed 



in EPO 174,444, pubIished;Maicfa 19, 1986; hybrids including heterologous viral 
sequences for yeast expression ate disclosed in EPO 175,261, published March 26, 
1966. These constructs may also be expressed in mammalian cells such as 
Chinese hamster ovaiy (CHO) cells using an SV40-dihydrafolate reductase vector 
5 (Michelle* at (1984)). 

In addition, portions of the particle-forming protein coding sequence 
may be replaced with codons encoding an HCV epitope. In this rqjlacement, 
regions which are not required to mediate die aggregation of the units to form 
immunogenic particles in yeast or mammals can be deleted, thus eliminating 
10 adrf fiKnmi mv antigenic sites from competition with the HCV epitope. 

ILR Preparation of Vaccines 

Vaccines may be prepared from one or more immunogenic 
polypeptides derived from HCV. These polypeptides may be expressed in 

15 various host cells {e.g., bacteria, yeast, insect, or mammalian cells), or 
alternatively may be isolated from viral preparations or made synthetically. 
Single- or muM-valenl vaccines against HCV may be comprised of one or more 
epitopes from one or more structural proteins, and/or one or more epitopes finom 
one or more nonstructural proteins. Thew vaccines may be comprised of, for 

20 example, recombinant HCV polypeptides and/or polypeptides isolated from the 
virions. In particular, vaccines are contemplated comprising one or more of the 
following HCV proteins, or subunit antigens derived therefrom: El, E2, C, NS2, 
NS3, NS4 and NS5. Particularly preferred are vaccines comprising El and/or E2, 
or submits thereof. 

25 In addition to the above, it is also possible to prepare live vaccines 

of attrrntatp** microorganisms which express one or more recombinant HCV 
polypeptides. Suitable attenuated microorganisms are known in the art and 
include, for example, viruses (eg., vaccinia virus (see Brown ex al. (1986)), as 
well as bacteria. 

30 The preparation of vaccines which contain an immunogenic 

polypeptide® as active ingredients, is known to one skilled in the art Typically, 
such vaccines are prepared as injectable, either as liquid solutions or suspensions; 
solid farms suitable for solution in, or suspension in, liquid prior to injection may 



also be prepared. The preparation may also be emulsified, or the protein 
encapsulated in liposomes. Hie active immunogenic ingredients are often mixed 
with exeipients which are phannaceuticaDy acceptable and compatible with the 
active ingredient Suitable exeipients ate, for example, water, saline, dextrose, 
5 glycerol, ethanol, or the like and combinations thereof. In addition, if desiied, the 
vaccine may contain minor amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants which enhance the 
effectiveness of the vaccine. Examples of adjuvants which may be effective 
include but ate not limited to: alnnrinum hydroxide, N-acetyl-muramyl-L-tfareonyl- 

10 D-isoglutamine (thr-MDP), N-ac^l-nor-muiamyl-L-alanytD-isoghitamiK (CGP 
11637, referred to as nor*MDP), N-ac^lmuiamyl-L-alanyl-D-isoglutaminyl-L- 
alanine-2-(l '-2'^iipalmitoyl-sn^ (CGP 
19835A, referred to as MTP-PE), and RIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, trehalose dimycolate and cell 

15 wall skeleton (MPL+TDM+CWS) in a 2% squalene/Tweeri* 80 emulsion. The 
effectiveness of an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide containing an HCV 
a ntigenic sequence resulting from administration of this polypeptide in vaccines 
which are also comprised of the various adjuvants. 

20 The vaccines axe conventionally administered parenterally, by injec- 

tion, -for example, either subcutaneously or intramuscularly. Additional 
formulations which are suitable for other modes of administration include 
suppositories and, in some cases, oral formulations. For suppositories, traditional 
binders and carriers may include, for example, polyaUcylene glycols or 

25 triglycerides; such suppositories may be formed from mixtures containing the ac- 
tive ingredient in the range of 0.5% to 10%, preferably 1 %-2%. Oral formu- 
lations include such normally employed exeipients as, for example, pharmaceutical 
grades of manmtol, lactose, starch, magnesium steazate, sodium saccharine, 
cellulose, magnesium carbonate, and the like. These compositions take the form 

30 of solutions, suspensions, tablets, pills, capsules, sustained release formulations or 
powders and contain 10%-95% of active ingredient, preferably 25%-70%. 

Hie proteins may be formulated into the vaccine as neutral or salt 
forms. Pharmaceutical^ acceptable salts include the acid addition salts (formed 
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with free amino groups of the peptide) and which are formed with inorganic adds 
such as, far example, hydrochloric or phosphoric acids, or such organic acids such 
as acetic, oxalic, tartaric, maleic, and the like. Salts formed with the free 
carboxyl groups may also be derived from inorganic bases such as, for example, 
5 sodium, potassium, ammonium, calcium, or feme hydroxides, and such organic 
bases as isqprqpylamine, trimethylamine, 2-^ylamino ethanol, histidine, 
procaine, and the like. 

TLB. Dosape and A Hmttrigtrarin n of Vaccines 

10 He vaccines are administered in a manner compatible with the 

dosage formulation, and in such amount as will be prophylactically and/or 
therapeutically effective. The quantity to be administered, which is generally in 
the range of 5 fig to 250 fig of antigen per dose, depends on the subject to be 
treated, capacity of the subject's immune system to synthesize antibodies, and the 

15 ^degree of protection desired. Precise amounts of active ingredient required to be 
administered may depend on the judgment of the practitioner and may be peculiar 
to each subject 

The vaccine may be given in a single dose schedule, or preferably 
hi a multiple dose schedule. A multiple dose schedule is one in winch a primary 

20 course of vaccination may be with 1-10 separate doses, followed by other doses 
given at sphwpgnt. time intervals required to maintain and or reenforce the 
immune response, for example, at 1-4 months for a second dose, and if needed, a 
subsequent dose(s) after several months. The dosage regimen will also, at least in 
part, be determined by the need of the individual and be dependent upon the 

25 judgment of the practitioner. 

In addition, the vaccine containing die immunogenic HCV antigen(s) 
may be administered in conjunction with other immunoregulatory agents, for 
example, immnne globulins* 

30 ILG. Pnymtion of Antibodies Ac^ WPy Epitopes 

The immunogenic polypeptides described herein are used to produce 
antibodies, including polyclonal and monoclonal If polyclonal antibodies are 
desired, a selected mammal (e.£. f mouse, rabbit, goat, horse, etc.) is immunized 



with an immunogenic polypeptide bearing an HCV epitope(s). Serum from the 
immunized animal is collected and treated according to known procedures. If 
sennn containing polyclonal antibodies to an HCV epitope contains antibodies to 
other antigens, the polyclonal antibodies can be purified by immunoaffinity 
5 chromatography. Techniques far producing and processing polyclonal antisera are 
known in the ait, see for example, Mayer and Walker (1987).Alternatively, 
polyclonal antibodies may be isolated from a mammal which has been previously 
infected with HCV. 

Monoclonal antibodies diiected against HCV epitopes can also be 

10 readily produced by one skilled in the ait The general methodology for making 
monoclonal antibodies by hybridomas is well known. Immoital antibody- 
producing cell lines can be created by cell fusion, and also by otter techniques 
such as direct transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Ban virus. See, e.g., M. Schreier ex cL (1980); 

15 Hammeding ef aL (1981); Kennett et aL (1980); gffi also, U.S. Patent Nos. 

4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491,632; 
and 4,493,890. Panels of monoclonal antSxxlies produced against HCV epitopes 
can be screened for various properties; i.e., for isotype, epitope affinity, ere. 

Andbodies^ett^rotJonal and polyc^^^vhich are diiected 

20 against HCV epitopes are particularly usefol irra^noSsTand those which are 
neutralizing are useful in passive- immunotherapy. Monoclonal antibodies, in • 
particular, may be used to raise antiidiotype antibodies. 

Anti-idiotype antibodies are immunoglobulins which cany an 
"internal image 0 of the antigen of the infectious agent against which protection is 

25 desired. See, for example, Nisonoff, A. 9 etaL (1981) and Dreesman et al. 

(1985). Techniques for raising antiidiotype antibodies aie known in the ait. See, 
for example, Grzych (1985), MacNamara et aL (1984), and Vytdehaag et aL 
(1985). These anti-idiotype antibodies may also be useful for treatment, 
vaccination and/or diagnosis of NANBH, as well as for an elucidation of the 

30 immunogenic legions of HCV antigens. 
H.H. Immunoassay and Diagnostic Kits 

Both the polypeptides and the antibodies of the present invention are 
useful in immunoassays to detect presence of HCV antibodies, or the presence of 



the virus and/or HCV polypeptides (or epitopes), in, for example, biological 
samples. Design of tbe immunoassays is subject to a great deal of variation, and 
many formats are known in the art The immunoassay will utilize at least one 
viral epitope derived from HCV. In one embodiment, the immunoassay uses a 
5 combination of viral epitopes derived from HCV. These epitopes may be derived 
from die same or from different viial polypeptides, and may be in separate 
recombinant or natural polypeptides, or together in the same recombinant polypep- 
tides. An immunoassay may use, for example, a monoclonal antibody directed 
towards a viral qritope(s), a combination of monoclonal antibodies directed 
10 towards epitopes of one viral antigen, monoclonal antibodies directed towards 
epitopes of different viral antigens, polyclonal antibodies directed towards the 
same viral antigen, or polyclonal antibodies directed towards different viral 
antigens. Protocols may be based, for example, upon competition, or direct re- 
action, or sandwich type assays. Protocols may also, for example, use solid sup- 
15 I ports, or may be by immuroprecipitation. Most assays involved the use of labeled 
antibody or polypeptide; the labels may be, for example, enzymatic, fluorescent, 
chemilnminescent, radioactive, or dye molecules. Assays which amplify die 
signals from die probe are also known; examples of which are assays which utilize 
biotin and avidin, and enzyme-labeled and mediated immunoas&ys, such as 
20 ELBA assays (described below). 

Typically, an immunoassay for an anti-HCV antibody(s) will involve 
selecting and preparing the test sample suspected of containing the antibodies, such 
as a biological sample, then incubating it with an antigenic (i.e., epitope- 
containing) HCV polypeptide^) under conditions that allow antigen-antibody 
25 complexes to form, and then detecting the formation of such complexes. Suitable 
incubation conditions are well known in the art The immunoassay may be, 
without limitations, in a heterogenous or in a homogeneous format, and of a 
standard or competitive type. 

In a heterogeneous format, the polypeptide is typically bound to a 
30 solid support to facilitate separation of the sample from the polypeptide after 

incubation. Examples of solid supports that can be used are nitrocellulose (e.£., in 
mCTfrffre or microtiter well form), polyvinyl chloride (e.g., in sheets or 
nricrotiter wells), polystyrene latex (e.g., in beads or microtiter plates, 



line fhionde (known as Immulorf*), diazotizedpar 



pdyvinylidine fluoride (known as Immulon*), diazotizedpaper, nylon membranes, 
activated beads, and Protein A beads. For example, Dynatech Immulon® 1 or 
Immulon® 2 microtiter plates or 0.25 inch polystyrene beads (Precision Plastic 
Ball) can be used in tbe heterogeneous format. The solid support containing the 
5 antigenic polypeptide is typically washed after separating it from the test sample, 
andprior to detection of bound antibodies. Berth standard and co mpe t i t i ve formats 
axe known in the art 

In a homogeneous format, the test sample is incubated with antigen 
in solution. For example, it may be under conditions that will precipitate any 
10 antigen-antibody complexes which are formed Both standard and competitive 
formats far these assays axe known in the art 

In a standaxd format, the amount of HCV antibodies forming the 
antibody-antigen complex is directly monitored Tins may be accomplished by 
determining whether labeled anti-xenogenic {e.g. 9 anti-human) antibodies which 
15 recogmze an epitope on anti-HCV antibodies will bind due to complex formation. 
In a competitive format, the amount of HCV antibodies in tbe sample is deduced 
by monitoring the competitive effect on the binding of a known amount of labeled 
antibody (or other competing ligand) in the complex. 

Complexes formed comprising anti-HCV antibody (or, in the cate of 
20 competitive assays, the amount of competing antibody) are detected by any of a 
number of known techniques, depending on tbe format For example, unlabeled 
HCV antibodies in the complex may be detected using a conjugate of 
antixenogeneic Ig complexed with a label, (e.g. 9 an enzyme label). 

In immunoassays where HCV polypeptides are the analyte, the test 
25 sample, typically a biological sample, is incubated with anti-HCV antibodies under 
conditions that allow the formation of antigen-antibody complexes. Various 
formats can be employed. For example, a "sandwich assay* may be employed, 
where antibody bound to a solid support is incubated with the test sample; washed; 
incubated with a second, labeled antibody to the analyte, and the support is washed 
30 again. Analyte is detected by determining if the second antibody is bound to the 
support. In a competitive format, which can be either heterogeneous or 
homogeneous, a test sample is usually incubated with antibody and a labeled, 



competing antigen is also incubated, either sequentially or simultaneously. These 
and other formats are well known in the ait 

Efficient detection systems for HCV infection may include the use 
of panels of epitopes, as described above. The epitopes in the panel may be 
5 constructed into one or multiple polypeptides. The assays for the varying epitopes 
may be yvptptiriai or simultaneous. 

The enzyme-linked immunosorbent assay (ELBA) can be used to 
measure either antigen or antibody concentrations. This method depends iqxra 
cxngugation of an enzyme to either an antigen or an antibody , and uses the bound 

10 enzyme activity as a quantitative label. To measure antibody, the known antigen 
is fixed to a solid phase (eg., a microplate or plastic cup), incubated with test 
serum (Stations, washed, incubated with anti-immunogiobulin labeled with an 
enzyme, and washed again. Enzymes suitable for labeling are known in die art, 
and include, far example* horseradish peroxidase. Enzyme activity bound to the 

15 solid phase is measured by adding die specific substrate, and determining product 
formation or substrate utilization colorimetrically. The enzyme activity bound is a 
direct function of die amount of antibody bound. 

To measure antigen, a known specific antibody is fixed to the solid 
phase, the test material containing antigen is added, after an incubation the solid 

20 phase is washed, and a second enzyme-labeled antibody is added. After washing, 
substrate is added, and enzyme activity is estimated colorimetrically, and related to 
antigen concentration. Kits suitable for immunodiagnosis and containing the 
appropriate labeled reagents are constructed by packaging the appropriate 
materials, including the polypeptides of the invention containing HCV epitopes or 

25 antibodies directed against HCV epitopes in suitable containers, along with the 
remaining reagents and materials required for the conduct of the assay, as well as 
a suitable set of assay instructions. 

HI. General Methods 

30 The general techniques used in the practice of the present invention 

can be found in, for example, the references cited herein, particularly EPO Pub. 
Nos. 318,216 and 388,232, as well as the references in die bibliography, which 
are incorporated herein by reference. 




IV. Bxampfe s 

Described below are examples of the present invention which axe 
provided only for illustrative purposes, and not to limit the scope of the present 
invention. In light of the presort disclosure, numerous embodiments within the 
5 scope of die claims win be apparent to those of ordinary skill in the art. 

IV.A. Epitope Mapping of HCV Genome 

The following example is the result of an epitope mapping 
experiment conducted cm the HCV1 polyprotein sequence shown in Fig. L As 

10 shown in Fig. Ml, there are heterogeneities among HCV isolates, footing that 
these amino acid substitutions can be made in the octamera described below. In 
addition to substitutions with amino acids from the corresponding location in other 
HCV isolates, substitutions with synthetic analogs of the particular amino acids or 
conservative substitutions based on charge, etc. (particularly when the substitution 

15 does not destroy antibody binding) arc within the scope of the invention. 

IVJU. Synthesis of Overlapping Peptides 

Polyethylene pins arranged on a block in an 8x12 array (Coselco 
Mimetopes, Victoria, Australia) were prepared by placing the pins in a bath (20% 

20 v/v piperidine in dimethyKbrroamirie (DMF)) for 30 minutes at room temperature. 
The pins were then removed, washed in DMF for 5 min, then washed in methanol 
four times (2 min/ wash). The pins were allowed to air dry for at least 10 min, 
then washed a final time in DMF (5 min). 1-Hydroxybenzotriazole (HOBt, 367 
mg) was dissolved in DMF (80 mL) for use in coupling Fmoc-protected amino 

25 adds: Fmoc-L-Ala-OPip, Fmoc-L-Cys(TrtH)P^), Fmoc-L-AqKOfflu^OPfy, 

Fmoc-L-Glu(0-iBuH)Pfp, Fmoc-L-Fhe-OPfp, Fmoc-Gly-OPfp, Fmoc-L-His(Boc)~ 
OPfp, Fmoc-L-Ile~OPfp, Fmoc-L-Lys(Boc)-OPfp, Fmoc-L-Leu-OPfp, Fmoc-L-Met- 
OPfp, Fmoc-L-Asn-OPfp, Fmoc-L-Pro-OPfp, Fmoc-L^ln-OPfp, Fmoc-L- 
Arg(Mtr)-OPfp, Fm«>L-Ser(/-Bu)-ODhbt, Fmoc-L-Tta(r-Bu)-ODhbt, Fmoc-L-Val- 

30 OPfp, and Fmoc-L-Tyr-OPfp. 

Hie protected amino acids were placed in microtiter plate wells with 
HOBt, and the pin blr-^k placed over the plate, immersing the pins in the wells. 
The assembly was then sealed in a plastic bag and allowed to react at 25°C for 18 



d 



• & 

hours to couple the first amino acids to the pins. The block was then removed, 
and the pins washed with DMF (2 min), MeOH (4 x 2 min), and again with 
DMF (2 min) to clean and depiotect the bound amino acids. The procedure was 
repeated for each ad riftittM amino acid coupled, until all octamers had been pre- 
5 pared. 

The fiee N-tennini weie then acetylated to compensate for the free 
amide, as most of the epitopes are not found at the N-tenninus and thns would not 
have the associated positive charge. Acetylation was accomplished by filling the 
wells of a microliter plate with DMF/acetic anhydride/triethylamine (5:2:1 v/v/v) 
10 and allowing the pins to react in the wells for 90 min at 20°C. The pins were 
then washed with DMF (2 min) and MeOH (4 X 2 min), and air dried for at least 
10 min. 

The side cfa™ protecting groups were removed by treating the pins 
with trifluoroacetic acid/phenoUditMoethane (95:2^:2.5, v/v/v) in polypropylene 
15 bags for 4 hours at room temperature. The pins were then washed in dichloro- 
methane (2x2 min), 5% di-isopropylethylamine/di(±Ioromethane (2x5 min), 
dichloromethane (5 min), and air-dried for at tesst 10 min. The pins were then 
washed in water (2 min), MeOH (18 hours), dried in vacuo, and stored in sealed 
plastic bags over silica geL 

20 

IVJL2. Assay of Peptides 

Octamer-bearing pins prepared as above were first treated by son- 
icating for 30 min in a disruption buffer (156 sodium dodecylsulfate, 0.1% 2-mei^ 
captoethanol, 0.1 M NaHjPO^ at 60°C. The pins were then immersed several 

25 times in water (60°Q, followed by boiling MeOH (2 min), and allowed to air dry. 
The pins were then precoated for 1 hour at 25°C in microliter wells containing 
200 itL blocking buffer (1% ovalbumin, 1% BSA, 0.1% Tween®, and 0.05% 
NaN 3 in PBS), with agitation. The pins were then immersed in microtiter wells 
containing 175 /iL antisera obtained from human patients diagnosed as having 

30 HCV and allowed to incubate at 4°C overnight The pins were assayed against 
antisera from three individual patients. Specimen #PAA 3663-s ("A") exhibited 
strong reaction to HCV Western blots, HCV competitive ELBA, HCV EUSA to 
clone C100-3 (at 1:1000 dilution), and MBA responses of >4+ to C100, 5-1-1, 



.A 



and C33c (CZZnot done). (Antigen/clone names are perEPO Pub. Nos. 318,216 
and 388,232, as well as those described in the literature regarding HCV 
immunoassays available from Ortho Diagnostics Systems, Inc.) Neat plasma was 
diluted 1:500 in blocking buffer. Specimen #PAA 33028 ("B") exhibited strong 
reaction to HCV Weston blots, HCV competitive ELBA, HCV ELISA to clone 
C100-3 (at 1:500 dflmion), and HBA responses of > 4+ to C100, 5-1-1, C33C 
and C22. Polyclonal antisexa was partially purified by passage through a protein 
A column, and was used at a dilution of 1:200 in blocking buffer. S pecunen 
#PAA $32931 fC*) exhibited moderate reaction to HCV Western blots <3+), 
HCV competitive ELISA, HCV ELISA to done C100-3 (at 1:64 dilution), and 
MBA responses of 3+ and 4+ to C100 and 5-1-1, respectively (C33c and C22 
not done). Polyclonal antiseia was partially purified by passage through a protein 
A column, and was used at a dilution of 1:500 in blocking buffer. 

The pins were washed in FBS/Tweerf* 20 (4 x 10 min) at room 
temperature, then incubated in microther wells containing horseradish peroxidase- 
labeled goat anti-Human Ig antiseia (175 jiL, 1:2000 dilution in blocking buffer 
without NaNa) for 1 hour at 25 °C with agitation. The anrih nman antisera is spe- 
cific for human Ig light and heavy chains, and reacts with bcih IgG and IgM 
classes. The pins were again washed in FBS/Tweerf* 20 (4 x 10 min) at room 
temperature. Substrate solution was prepared by diluting NaH 2 P0 4 (1 M, 200 mL) 
and citric acid (1 M, 160 mL) to 2 L with distilled water, adjusting the pH to 4.0. 
Azino^-3-ethyIbazthiazodimuIfonate (ABTS, 50 mg) and hydrogen peroxide 
(0.3 pL/mL) was added to 100 mL of buffer immediately prior to use to complete 
the substrate solution. The substrate solution (150 pL) was added to each well of 
a microther plate, and the pins immersed in the wells and incubated at 25 °C in the 
dark. After color developed, the reactions were halted by removing die pins, and 
absorbance of the solutions read at 405 nm. 

The octamers listed below were immunoreactive with anti-HCV 
antisera. Peptides reacting with all three antisera are listed as epitopes, while pep- 
tides reacting with only one or two antisera are listed as weak epitopes (indicated 
by " ~ "). Particularly strong epitopes are labeled with letters rather than numbers 
(e.g M EpAA). 
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15 IV.B. Ift#tfA siff 

Hie following assay was performed to distinguish early antigens 
from later antigens. Antibodies to die early antigens may be detected, and used to 
diagnose HCV infection more quickly. 

Serial bleeds rvere obtained from a human patient presenting with 

20 elevated ALT, but negative for anti-C100-3 antibody. The five bleeds obtained 
prior to complete seroconversion (C100-3 positive) were pooled and used in the 
assay at a dilution of 1:2000. The assay was conducted as described in Section 
IV.A. above. However, one duplicate set of pins was incubated with horseradish 
peroxidase-labeled goat anti-Human IgG specific antisera, while the other set was 

25 incubated with horseradish peraridase-labeled goat anti-Human IgM specific anti- 
sera. Epitopes immunoreactive with IgM antibodies are early epitopes. 

The results indicated that most early epitopes are found in the 
region extending from about amino acid 480 to about amino acid 650. Particularly 
strong IgM epitopes were octamers beginning with amino acid nos. 506, 510, 523, 

30 553, 562, 580, and the region from 590 to 620. Assays which employ antigens 
bearing epitopes from this region will permit diagnosis of HCV infection at an 
early point than assays employing other antigens. 

We have additionally tested serial plasma specimens taken from five 
patients with open heart post-transfusion NANB hepatitis, with studies followed 

35 for 3*12 years. Initial bleed dates were less than one week spot. Each specimen 
was tested for IgG and IgM by HA against one core antigen (C22) , two envelope 
antigens (El and E2), and three nonstructural region antigens (C33c, C100, and 



NSS). We found that the IgM response to C22 and C33c pieceeded the IgG 
response for those antigens. NS*5 also induced an IgM response, bat this 
response did not proceed the IgG response for that antigen. Thus, one can prepare 
assays capable of ^prmimrig very early stages of infection by utilizing epitopes 
5 derived from the C22 and C33c regions and assaying for IgM binding. Antibodies 
to the C33c region persisted for the longest period of time, suggesting that 
diagnostic assay s directed toward C33c should be the most reliable. 

IV.C. Sequence Va riations m HCV Isolates from Different Individuals 
10 isolates of HCV which contain sequences which deviate from 

CDC7HCV1 were identified in human individuals, some of whom were 
serologically positive for anti-ClOO-3 antibodies (EC10 was antibody negative). 
Identification of these new isolates was accomplished by cloning and sequencing 
segments of the HCV genome which had been amplified by the PGR technique 
IS using CDC/HC1 sequences. The method utilizes primers and probes based upon 
the HCV cDNA sequences described herein. The first step in the method is the 
synthesis of a cDNA to either the HCV genome, or its implicative intermediate, 
using reverse transcriptase. After synthesis of the HCV cDNA, and prior to 
amplification, the SNA in the sample is degraded by techniques known in die art. 
20 A designated segment of the HCV cDNA is then amplified by the use of the 
appropriate primers. The amplified sequences are cloned, and clones containing 
the amplified sequences are detected by a probe which is complementary to a 
sequence lying between the primers, but which does not overlap the primers. 

25 IV.C.1. HCV Isolates Isolated from Humans in „t>».JIA 

Blood samples which were used as a source of HCV virions were 
obtained from the American Bed Cross in Charlotte, North Carolina, and from the 
Community Blood Center of Kansas, Kansas City, Missouri. The samples were 
screened for antibodies to the HCV C100-3 antigen using an FT ISA assay and 

30 subjected to supplemental Western blot analysis using a polyclonal goat anti-human 
HBP to measure anti-HCV antibodies. Two samples, #23 and #27, from the 
American Bed Cross and from the Community Blood Center of Kansas, 
respectively, were determined to be HCV positive by these assays. 



Vital particles present in the serum of these samples were isolated 
by uhraceotrifugation under the conditions described by Bradley et al. (1985). 
KNA was extracted from the particles by digestion with proteinase K and SDS at 
final concentrations of 10 /tg/mL proteinase K, and 0.1 % SDS; digestion was for 
5 1 hour at 37°C. Viral RNA was further purified by extraction with chloroform- 
phenol. 

HCV RNA in the preparation of RNA was reverse transcribed into 
cDNA. After both strands of the cDNA were synthesized, the resulting cDNA 
was then amplified by the PCR method. The HCV cDNAs in three clones derived 
10 from each HCV isolate, were subjected to sequence analysis. Analysis was 
essentially by the method described in Chen and Seebuig (1985). 

Consensus sequences of the clones derived from HCV in samples 23 
and 27 are shown in Fig. 3 and Kg. 4, respectively. Hie variable sequences are 
also shown in these figures, as are the amino acids encoded in die consensus 
15 sequences. 

Figures 5 and 6 show comparisons of die aligned positive strand 
nucleotide sequences (Fig. 5) and putative amino add sequences (Fig. 6) of 
samples 23, 27, and HCV1. The amino acid sequence of HCV1 in Fig. 6 
represents amino add numbers 129-467 of the HCV poiyrootein encoded by the 

20 large ORF in the HCV genomic SNA. An examination of Figs. 5 and 6 show that 
there are variations in the sequences of the three isolated clones. The sequence 
variations at the nucleotide levd and the amino add levd are summarized in the 
table immediately below. In the table, the polypeptides designated S and NS1 
represent amino acid numbers 130 to -380, and 380 to -470, respectivdy, as 

25 those domains were previously known. The numbering is from the putative 

initiator methionine* The terminology S and NS1 is based upon the positioning of 
the sequences encoding the polypeptides using the Flavivinis model. As discussed 
above, however, recent evidence suggests that there is not total correlation 
between HCV and the Flaviviruses with regard to viral polypeptide domains, 

30 particularly in the putative E/NS1 domains. Indeed, HCV polypeptides and their 
coding domains may exhibit substantial deviation from the Flavivinis model. 



Viral particles present in the serum of these samples were isolated 
by ultiacentrifugation tinder the conditions described by Bradley ex al. (1985). 
RNA was extracted from the particles by digestion with proteinase K and SDS at 
final concentrations of 10 ngfmL proteinase K, and 0.1 % SDS; digestion was for 
5 lhourat37°C Viral RNA was Anther purified by extraction with chloroform- 
phenol. 

HCV RNA in the preparation of RNA was reverse transcribed into 
cDNA. After both strands of the cDNA were synthesized, die resulting cDNA 
was then amplified by the PGR method. The HCV cDNAs in three clones derived 
10 from each HCV isolate, were subjected to sequence analysis. Analysis was 
essentially by the method described in Chen and Seeburg (1985). 

Consensus sequences of the clones derived from HCV in samples 23 
and 27 are shown in Fig. 3 and Fig. 4, respectively. The variable sequences are 
also shown in these figures, as are the amino acids encoded in the consensus 
IS sequences. 

Figures 5 and 6 show comparisons of the aligned positive strand 
nucleotide sequences (Fig. 5) and putative amino acid sequences (Fig. S) of 
samples 23, 27, and HCVL The amino acid sequence of HCV1 in Kg. 6 
represents amino acid numbers 129-467 of the HCV polyprotein encoded by the 

20 large ORF in the HCV genomic RNA. An examination of Figs. 5 and 6 show that 
there are variations in the sequences of the three isolated clones. The sequence 
variations at the nucleotide level and the amino add level are summarized in the 
table immediately below. In the table, the polypeptides designated S and NS1 
represent amino acid numbers 130 to -380, and 380 to -470, respectively, as 

25 those domains were previously known. The numbering is from the putative 

initiator methionine. Hie terminology S and NS1 is based upon the positioning of 
the sequences encoding the polypeptides using the Flavivims model. As discussed 
above, however, recent evidence suggests that there is not total correlation 
between HCV and the Flaviviruses with regard to viral polypeptide domains, 

30 particularly in the putative E/NS1 domains. Indeed, HCV polypeptides and then- 
coding domains may exhibit substantial deviation from the Flavivims model. 
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84 



10 Although there are variations in die newly isolated HCV sequences, 

the cloned sequences from samples 23 and 27 (called HCV23 and HCV27) each 
contain 1019 nucleotides, indicating a lack of deletion and addition mutants in this 
region in die selected clones. The sequences in Figs. 5 and 6 also show that the 
isolated sequences are not rearranged in this region. 

15 A comparison of the consensus sequences for HCV1 and for the 

other isolates of HCV is summarized in the Table, supra. The sequence variations 
between the chimpanzee isolate HCV1, and die HCVs isolated from humans are 
about the same as that seen between the HCVs of human origin. 

' it is of interest that the sequence variations in two of the putative 

20 domains is not uniform. Hie sequence in a putative S region appears to be 

relatively constant, and randomly scattered throughout the region. In contrast, a 
putative NS1 region has a higher degree of variability than the overall sequence, 
and die variation appears to be in a hypervariable pocket of about 28 amino acids 
which is located about 70 amino acids downstream from the putative N-tenninus 

25 of the putative polyprotein. 

Although it may be argued that the detected variations woe 
introduced during the amplification process, it is unlikely that all of the variations 
are from this result. It has been estimated that Taq polymerase introduces errors 
into a sequence at approximately one base per 10 kilobases of DNA template per 

30 cycle (SaM er aL (1988)). Based upon this estimate, up to 7 errors may have 
been introduced rinr fo g the PCR amplification of the 1019 bp DNA fragment. 
However, die three subclones of HCV-23 and HCV-27 yielded 29 and 14 base 
variations, respectively. The following suggest that these variations are naturally 



t • 

occurring. About 60% of the base changes axe silent mutations which do not 
change the amino acid sequence. Variations introduced by the Taq polymerase 
during PGR amplification would be expected to occur randomly; however, the 
results show that the variant sequences axe clustered in at least one specific region. 

IV.C.2. HCV Isolates from Humans in Italy and in the XJ ^, 

Segments of HCV RNA present in different isolates were amplified 
by the HCV/cPCR method, Ibese segments span a region of -0.6 Kb to -1.6 
Kb downstream from the methionine encoding start codon of the putative HCV 
polyprotein. The isolates are from biological specimens obtained from HCV 
infected individuals. More specifically, isolate HCT #18 is from human plasma 
from an individual in the U.S.A., EC1 and EC10 are from a liver biopsy of an 
Italian patient, and 111 is from a peripheral blood mononucleocyte fraction of an 
American patient Comparable segments of HCV UNA have been isolated from a 



RNA was extracted from the human plasma specimens using 
phenolrCHCVisoamyl alcohol extraction. Other 0.1 mL or 0.01 mL of plasma 
was diluted to a final volume of 1.0 mL, with a TENB/proteinase K/SDS solution 
(0.05 M Itis-HCL, pH 8.0, 0.001 M EDTA, 0.1 M NaCl, 1 mg/mL Proteinase 
K, and 0.5% SDS) containing 10 to 40 ngfmL polyadenylic acid, and incubated at 
37°C for 60 minutes. After this proteinase K digestion, the resultant plasma 
fractions were deproteinized by extraction with TE (50 raM Tris-HCl, pH 8.0, 1 
mM EDTA) s a turat ed phenol, pH 6.5. The phenol phase was separated by 
centrifugaiion, and was reextracted with TENB containing 0.1% SDS. The 
resulting aqueous phases from each extraction were pooled, and extracted twice 
with an equal volume of phenoy<±lorofbnn/isoamyl alcohol [1:1(99:1)], and then 
twice with an equal volume of a 99:1 mixture of chlorofbnn/isoamyl alcohol 
Following phase separation by ceutrifugation, the aqueous phase was brought to a 
final concentration of 0.2 M Na acetate, and the nucleic acids were precipitated by 
the addition of two volumes of ethanol. The precipitated nucleic acids were 
recovered by ultraceatrifugation in a SW 41 rotor at 38 K, for 60 minutes at 4°C 
or in a microfuge for 10 minutes at 10 K, 4°C. 



SNA extracted from the Jiver biopsy was provided by Dr. F. 
Bonino, Ospedale Maggiore di S. Giovanni Battistn, Torino, Italy. The 
mononncieocyte fraction was obtained by sedimentation of the individual's aliquot 
of blood through FicoB-PaqueP (Phannacia Corp), using the manufacturer's dxr- 
5 ections. Total RNA was extracted from the fraction using the guanidinium thio- 
cyanate pxocedute described in Choo et al (1989). 

Synthesis of HCV cDNA from the samples was accomplished using 
reverse transcriptase. Following ethanoi precipitation, the precipitated KNA or 
nucleic acid fraction was dried, and resuspended in DEPC treated distilled water. 

10 Secondary structures in the nucleic adds were disrupted by heating at 65°C for 10 
miHvteflj and the samples were immediately cooled on ice. cDNA was synthesized 
using 1 to 3 /*gof total RNA from liver, or from nucleic acids (or RNA) extracted 
from 10 to 100 /*L of plasma. The synthesis utilized reverse transcriptase, and 
was in a 25 /xL reaction, using the protocol specified by the manufacturer, BRL. 

15 All reaction mixtures for cDNA synthesis contained 23 units of the RNAase 
inhibitor, Rnasin® (Fisher/Promega). Following cDNA synthesis, the reaction 
mixtures were diluted with water, boiled far 10 minutes, and quickly chilled on 
ice. 

Each set of samples was subjected to two rounds of PGR 
20 amplification. The primers for the reactions were selected to amplify regions 

designated "EnvL" and EnvR". The TEnvL" region encompasses nucleotides 669- 
1243, and putative amino acids 117 to 308; the "EnvR" region encompasses 
nucleotides 1215-1629, and encodes putative amino acids 300-408 (the putative 
amino adds are numbered starting from the putative methionine initiation codon). 
25 The PCR reactions were performed essentially according to the k 

manufacturer's directions (Cetus-Peridn-Ehner), except for the addition of 1 fig of 
RNase A. The reactions were carried out in a final volume of 100 /xL . The PCR 
was performed for 30 cycles, utilizing a regimen of 94°C (1 min), 37°C (2 min), 
and 72°C (3 min), with a 7 minute extension at 72 °C for the last cycle. The 
30 samples were then extracted with phenolzCHCI,, ethanoi precipitated two times, 
resuspended in 10 mM Tris HQ, pH 8.0, and concentrated using Centricon-30 
(Amicon) filtration. This procedure efficiently removes oligonucleotides less than 



30 nucleotides in size; thus, the primers from the! first round of PCR amplification 
aie removed. 

The Centricon-30 concentrated samples were then subjected to a 
second round of PCR amplification. Amplification by PCR was for 35 cycles 
5 utilizing a regimen of 94°C (1 thin), 60°C (1 mm) f and 72 °C (2 min), with a 7 
minute extension at 72 °C for the last cycle. The samples woe then extracted with 
phenol:CHCI 3 , precipitated two times, and digested with EcoRI. The PCR 
reaction products were analyzed by separation of the products by electrophoresis 
on 656 polyacrylamide gels. DNA of approxim ately the estimated size of the 

10 expected PCR product was eiectroehited from the gels, and subcloned into either a 
pGEM-4plasmid vector or into Xgtll. The expected product sizes for the EnvL 
and EnvR after the first round of amplification are 615 bp and 683 bp, 
respectively; after the second round of amplification the expected product sizes for 
EnvL and EnvR are 414 bp and 575 bp, respectively. The plasmids containing the 

15 amplified products were used to transform host cells; the pGEM-4 plasmid was 
used to transform DH5~a^pha, and Xgtll was used to transform C600 delta-HFL 
Clones of the transformed cells which either hybridized to the appropriate HCV 
probes, or those which had inserts of the correct size were selected. The insert, 
were thsn cloned in M13 and sequenced. The probes for all of the HCV/cPCl 

20 products consisted of *P labeled sections of HCV cDNA which had been prepared 
by PCR amplification. 

Sequence information on variants in the EnvL region was obtained 
from 3 clones from HCT #18, 2 clones from TH, 3 clones from EC1, and from 
the HCV1 clones. A comparison of the composite nucleotide sequence of each 

25 isolate derived from these clones is shown in Fig. 7. In the figure, each sequence 
is shown 5' to 3' for the sense strand for the EnvL region, and the sequences have 
been aligned. The vertical lines and capital letters indicate sequence homology, 
die absence of a line and an uncapitalized letter indicates a lack of homology. The 
sequences shown in die lines are as follows: line 1, Thorn; line 2, EC1; line 3, 

30 HCT#18;line4,HCVl. 

Sequence information on variants in the EnvR region was obtained 
from two clones of EC10, and from HCV1 clones. The two EC10 clones differed 
by only one nucleotide. A comparison of the nucleotide sequences of EC10(clone 



2) and a composite of the HCV1 sequences is shown in Fig. 8; each sequence is 
shown 5 ' to 3 ' for the sense strand of die EnvR region, and the sequences have 
been aligned. The double dots between the sequences indicate sequence 
homology. 

5 A comparison of the amino arid sequences encoded in the EnvL 

(amino adds #117-308) and EnvR region (amino adds #300438) for each of the 
isolates is shown in Eg. 9 and Fig. 10, respectively. In cluded in the Figures axe 
sequences for the isolates JH23 and JH27, described above. Also indicated are 
sequences from a Japanese isolate; these se q uences were provided by Dr. T. 

10 Miyamura, Japan. In the figures, the amino arid sequence for the region is given 
in its entirety forHCVl, and the non-homologous amino acids in die various 
isolates axe indicated. 

As seen in Fig. 9, In the EnvL region these is overall about a 93% 
homology between HCV1 and the other isolates. HCT18, Th, and EC1 have 

15 about a 97% homology with HCV1; JH23 and JH27 have about 96% and about 
95% homology, respectively, with HCV1. Fig. 10 shows that the homologies in 
the EnvR region axe significantly less than in the EnvL region; moreover, one 
snbregion appears to be hypervariable (Lc, from amino acid 383-405). TUs data 
is summarized in the Table immediately below. 

20 Table: Homology of EnvR Region 

Isolate Percent Homology with-HCVl 

AA330-AA438 AA383-AA405 

JH23(U.S.) 83 57 

JH27(U.S.) 80 39 

25 Japanese 73 48 

EC10 (Daly) M 48 

VL Industrial Applicability 

The epitopes identified herein can be used to make polypeptide 
30 products as described above for applications such as the screening of blood for 
HCV infection, clinical HCV diagnosis, the generation of antibodies, and 
preparation of medicaments. Other applications are described above, and still 
others win be readily apparent to those of ordinary skin. 
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WHAT IS CLAIMED- 



1. A polypeptide comprising a truncated HCV sequence containing 
an HCV epitope of the formula 



wherein aa denotes an amino acid; 
x and y are integers such that y-x s 6; 

aa,-aa, indicates a portion of the amino acid seqnpnce of Figure 1; and 
x is seJectedJnan the group consisting of 23-34, 36, 66-79, 81-94, 96- 



98, 10W037186-189, 191/206, 223, 232, 256, 286, 297-299, 321, 347, 357, 



413, 414, 432, 465-471, 480-484, 501, 502, 521, 540-549, 579, 594-599, 601- 
613, 641, 662-^65, 685, 705, 706, 729, 782-789, 801, 851-855, 893, 916, 928, 
946, 952-954, 1026, 1072, 1109, 1112-1117, 1218, 1240, 1280-1285, 1322, 1338, 
1371, 1384, 1410, 1411, 1454, 1492, 1493, 1532-1535, 1560, 1561, 1566-1568, 
1571-1577, 1601-1607; 1615-1620, 1655, 1695, 1710-1712, 1728, 1729, 1758- 
1762, 1781, 1808, 1821, 1851, 1880, 1908-1913, 1925, 1940-1948, 1951, 1966- 
1969, 1999, 2001-2004, 2006-2014, 2024, 2048-2053, 2055-2057, 2071, 2088- 
2093, 2108, 2122-2148; 2165, 2187, 2226-2232, 2244-2249, 2267, 2281-2286, 
2288, 2289, 2325-2327, 2346, 2347, 2349, 2382, 2401, 2417-2422, 2439-2444, 
2446-2456, 2469, 2471-2476, 2495, 2533, 2534, 2573-2578, 2602-2604, 2606- 
2612, 2632-2638, 2660, 2676-2679, 2688-2693, 2707, 2721, 2757-2762, 2779, 
2794, 2795, 2797-2799, 2801, 2802, 2817-2843, 2863-2867, 2878-2884, 2886- 
2895. 

2. The polypeptide of claim 1 which is about 100 amino acids or less 

in length. 

' 3. The polypeptide of claim 2 wherein y-x £ 50. 

4. The polypeptide of claim 2 wherein y-x £ 20. 

5. The polypeptide of claim 2 wherein y-x < 10. 
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6. The polypqrtide of claim 2 wherein x is selected from the gronp 
consisting of 506, 510, 523, 553, 562, 580, and 590-620. 

7. A polypeptide of about 100 ammo acids or less comprising an 
5 HCV epitope of the formula 

aa^-aa, 

wherein aa denotes an amino acid; 
x and y are integers such that y-x s 6; 

aa^-aay indicates a portion of the amino acid sequence of Figure 1; and 
10 x is selected from the group consisting of 35 (where y is less than 45), 

80 (where y is less than 90), 95 (where y is less than 110), 99 (where y is less 
man 120), 100 (where y is less man 150), 190 (where y is less than 210), 500 
(where y is less man 550), 600 (where y is less than 625), 1260 (where y is less 
than 1280), 1569 (where y is less than 1931), 1570 (where y is less- man 1590), 
15 1694 (where y is less man 1735), 1949 (where y is less than 2124), 1950 (where y 
is less man 1985), 2000 (where y is less man 2050), 2005 (where y is less than 
2025), 2054 (where y is less man 2223), 2250 (where y is less man 2330), 2287 
(where y is less than 2385), 2290 (where y is less than 2310), 2345 (where y is 
less than 2375), 2348 (where y is less than 2464), 2445 (where y is less than 
20 2475), 2470 (where y is less than 2490), 2605 (where y is less than 2620), 2780 
(where y is less than 2830), 2796 (where y is less than 2886), 2800 (where y is 
less than 2850), and 2885 (where y is less man 2905). 

8. An immunoassay reagent comprising a polypeptide according to 
25 any of claims 1-7. 

9. The immunoassay reagent of claim 8 wherein y-x £ 50. 

10. The immunoassay reagent of claim 8 wherein x is selected from 
30 the group consisting of 506, 510, 523, 553, 562, 580, and 590-620. 



# • 

11. A method for detecting the presence of antibodies immunoieactive 
with Hepatitis C vims (HCV) proteins in a sample, said method comprising: 

contacting an immobilized immunoassay reagent according to claim 8 
with said sample; and 
5 deterring antibodies bound to said reagent 



12. A method for inducing an immunological response in a subject 
against HCV, said method comprising: 

10 administering to said subject an effective amount of a polypeptide 

accoding to any one of claims 1*7. 

13. A composition lor itwfa^iyg an immunological response in a 
subject against HCV, said composition comprising an effective amount of a 

15 polypeptide accoding to any one of claims 1-7. 

14. A monoclonal or polyclonal antibody composition wherein said 
antibodies bind the HCV epitope of a polypeptide acconfing to claim 1. 



20 



IS. A method of making a polypeptide according to any of claims 1*7 
wherein said polypeptide is prepared by recombinant expression or chemical 
Synthesis. 
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FIG. I - 1 



R T 

MSTNPKPQKXNKRNTNRRPODVKPPGGGQIVGGVYLLPRRGPRLGVRATR 
KTSERSQPRGRRQP I PKARRPEGRTWAQPG YPWPLYGNEGCGWAGWLLS P— 1 0 » > 
RGS R P SWG PTDP RRRSRNLGKVTDTLTCGFADLMG YI PLVG APLGG AARA 

T 

LAHGVRVLEDGVNYATGNLPGCSFS IFLLALLSCLTVPASAYQVRNSTGL- 200 

YHVTNDCPNSSIVYEAADAILHTPGCVPCVRBGNASRCWVAMTPTVATRD 

GKLPATQLRRHIDLLVGSATLCSALYVGDLCGSVFLVGQLFTPSPRRHWT-300 

V 

TQGCNCS IYPGHITGHRMAWDMMMNWSPTTALVMAOLLRIPOAILDMIAG 

AHWGVLAGIAYPSMVGNWAKVLVVIIiU'AGVDAETHV^ 

SLLAPGAKQNVQLIHTNGSWHLNSTALNCNDSLNTGWLAGLPYHHKFNSS 

GCPERLASCRPLTDFDQGWGPISYANGSGPDQRPYCWHYPPKPCGIVPAK-500 

SVCGPVYCPTPSPWVGTTDRSGAPTYSWGBNDTDVPVLNNTRPPLGNWF 

GCTWMNSTGFTKVCGAPPC7IGGAGNNTLHCPTDCFRKHPDATYSRCGSG-600 

I 

PWLTPRCLVDYPYRLWHYPCTINYTIFKIRMYVG6VEHRLEAACNWTRGE 
RGDLEDRDRSELSPLLLTTTQWQVLPCSPTTLPALSTGLIHLHQNIVDVQ-700 
YLYGVGSSIASWAIKWEYVVIXPLLIADARVC^CLWMMLLISOAEAALEN 
LVILN AASLAGTHGLVS FLVFFCFAWYLKGKWVPG AVYTFYGMWPLIJLIiL- 800 



SUBSTITUTE SHEET 



2/23 



LAL P ORA YALDTEVAAS CGGWLVGLMALTLS P YYKR Y I SWCLWWLQYFL 
TRVEAQLHVWI P P LNVRGG RDAVILIMCAVHPTLVFDITKLLLAVFG PLW- 900 
IIX)ASLLKVPYFVRVQGI^RFCALARKMIGGHYVQMVIIKLGALTGTYVY 
NHLTP LRDWAHNGLRDIAVAVEPWFSQMETKLITWGADTAACGD I INGL- 1 0 0 0 
PVSARRGREILLGPADGMVSKGWRLIAPITAYAQQTRGLLGCIITSLTGR 
DKNQVEGEVQIVSTAAQTFLATC INGVCWTVYHGAGTRTIASPKG PVIQM-1 1 0 0 

S T 

yTNVDODLVGWPAPCjGSRSLTPCTCGSSDLYLVTRHADVIPVRRRGDSRG 

SLLSPRPISYLKGSSGGPLLCPAGHAVGIFRAAVCTRGVAKAVDFIPVEN-1200 

LETTMRSPVFTDNSSPPWPQSFQVAHLHAPTGSGKSTKVPAAYAAOGYK 

L 

VLVLNPSVAATLGFGAYMSKAHGIDPNIRTGVRTITTGSP.ITYSTYGKFL-1300 



ADGGCSGGAYDI 1 1 CDECHSTDATSILG IGTVLDQAETAGARLWLATAT 
PPGSVTYPHPNIEEVALSTTGBIPFYGKAIPLEVIKGGRHLIFCHSKKKC-1 400 
DEIAAKLVAI/;iNAVAYYRGLDVSYIPTSGDVVVVATDALMTGYTGDFD& 

V (S) 
VIDCNTCVTQTVDFSLDPTFTIETITLPQDAVSRTQRRGRTGRGKPGIYR-1500 
FVAPGERPSGMFDSSVLCECYDAGCAVTYELTPAETTVRLRAYMNTPGLPV 
CQDHLEFWEGVFTGLTHIDAHFLSQTKQSGENLPYLVAYQATVCARAQAP-1600 
PPSWDQMWKCLIRLKPTLHGPTPLLYRLGAVQNEITLTHPVTKYIMTCMS 
ADLEVVTSTWVLVGGVLAALAAYCLSTGCVVTVGRVVLSGKPAIIPDREV-1700 
LYREFDEMEECSQHLPYIEOGMMLAEQFKQKALGLLQTASRQAEVIAPAV 
QTNWQ KLETFWAK HMWNF I SG I QYLAGLS TLPG N PA I ASIMAFTAAVTS P - 1 8 0 0 
LTTS0TLLFNILGGWVAAOLAAPGAATAFV6AGLAGAAIGSVGLGKVLID 



ILAGYGAGVAGALVAFKIMSGBVPSTEDLVNLLPAILSPGALWGWCAA-1 9 00 

(HC) 

I LRRH VGPG EGAVQWMNRLI AFAS RGNHVSPTHYVPESDAAARVTA l£SS 



FIG. I -2 
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LTVTQLLRRLHQW IS S ECTTPCSG SWLRD IWDWICEVLSDFKTWLKAKLM- 2000 

(V) 

POLPGIPFVSCQRGYKGVWRGDGIMHTRCHCGAEITGHVKNGTMRIVGPR 
TCRNMWSGTFP IN AYTTGPCTPLPAPNYTFALWRVSAEEYVEIRQVGDFH- 2100 
WtGMTTDNLKCPCQVPSPEFFTELDGVRLHRFAPPCKPLLREEVSFRVG 
LHEYP VGSQLPCEPEPDVAVLTSMLTDPSHITAEAAGRRLARGSPPSVAS- 2200 
SSASOLSAPSLKATCTANHDSPDAELIEANLLWRQEMGGNITRVESENKV 
VILDSFDPLVAEEDEREISVPAEILRKSRRFAQALPVWARPDYNPPLVET-2300 

(S) 

WKKPDYEPPVVHGCPLPPPKSPPWPPRKKRTVVLTESTLSTALAELATR 

(FA) 

SFGSS STSG ITGDNTTTSSEPAPSGCPPDSDAES YSSMPPLEGEPGDPDL- 2400 
SIX3SWSTVSSEANAEDVVCCSMSYSVn:GALVTPCAAEEQKLPINALSNSL 
LRHHNLVYSTTSRSACQRQKKVTFDRLQVLDSHYQDVLKEVKAAASKVKA- 2500 

(F) 

NLLSVEEACSLTPPHSAKSKFGYGAKDVRGHARKAVTHINSW7KDLLEDN 

VT P I D TTIMAKN EVFCVQP EKGG RKPARL IVFPDLGVRVCEKMAL YDWT- 2600 

KLPLAVMGSSYGFQYSPGQRVEFLVQAWKSKKTPMGFSYDTRCFDSTVTE 

(G> 

SDIRTEEAIYQCCDLDPOARVAIKSLTERLYVGGPLTNSRGENCGYRRCR-2700 
ASGVLTTSCGNTLTCYIKARAACRAAGLQDCTMLVCGDDLWICESAGVQ 
EDAASLRAFTEAMTRYSAPPGDPPQPEYDLELITSCSSNVSVAHDGAGKR-2800 
VYYLTRDPTTPLARAAWETARHTPVNSWLGNIIMFAPTLWARMILMTHFF 
SVLI ARDOLEQALDCEIYG ACYS IEPLDLPP I IQRLHGLSAFSLHSYSPG- 2 900 

G 

EINRVAACLRKI^VPPLRAWRHRARSVRARLLARGGRAAICGKYLFNWAV 

(P) 

RTKLKLTPIAAAGQIJDl^GWFTAGYSGGDIYHSVSHARPRWIWCLLLLA-3000 
AGVGIYLLPNRO-3011 

I 

Stop codon 



( ) - Heterogeneity due possibly 
to 5' or 3* terminal cloning 
artefact. 

FIG. I -3 
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o — o—G — 8 o 0=o=H 55 — S — o — o u — u — S — u 

S=S=|Eg ^=^-§-§ 8=8=8=8 8=8=g=g 
8=8=1=8 8=8=8=8 8=F2~S S=8=8=8 

g=g=2=l S— g— g— o g=g=g=g g— g— p— H 

g_g=g=g g=g=g_g- o-g— g-8 U— U— g=g 
S=S= =3 t:=g=g=g g=g=g=| S=g=g=g 

CJ — CJ — CJ — CJ CJ — U O O S — S tn CD — CD — CD CD 

g=g=g=g 8=8=8=8 I-g^s^l S=S=S=S 
H-t-g-g 8-8-8=8 g=g=g=g g=y=y=^ 

* f v o H rH XJ £ ^ ^ cr> ^ n n i-H 
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O Eh E-t En 

CJ — U — CJ — CJ 
^ Eh — Eh Eh 

cd — cd — cd — o 
cd — u — o — o 

CD CD — CD — CD 

U — U — CJ — CJ 
CD — CD — CD — CD 

Eh — Eh H — Eh 

CJ) — CD — CD (0 

Eh H— Eh Eh 

U — CJ -P o 

cj — cj — u — u 
< — < — < — < 

U — O — O — CD 
CD— CD— CD — CD 
CD — CD — CD — CD 

CD — CD CD CD 

CD CD — CD — CD 

Eh Eh b* — Eh 

CD — CD — CD — CD 
U — U -P 0 
< — < — < — < 
Eh — Eh — Eh — H 

U — CJ U— CJ 

Eh B E-t Eh 

cj — a — cj — cj 
cj — cj — cj — CJ 
cj — cj — o — CJ 

CD — CD — CD — CD 
CD— CD— —CD— CD 

a — u — cj — cj 

Eh — Eh — Eh — Eh 
CJ — CJ — U 4J 

cj — cj- — u — y 

EH — Eh — Eh — Eh 

CJ CJ CJ CJ 

CJ CJ CJ CJ 

CJ — CJ — CJ — u 

<•«— < — < — < 

O 4J U O 

CJ — o — U — CJ 

CD CD — CD CD 

CJ CJ — CJ CJ 

S=g=S=g 

CD — CD — CD — CD 
CD — CD — CD — CD 
CD — CD — CD — CD 

y — o — u— o 

H — Eh — Eh — Eh 

gz:g:zg:=g 

Eh !-. — Eh — Eh 

CJ — CJ — CJ — CJ 

g=g-g=g 

CJ — U — CJ — CJ 
Eh — EH — Eh — Eh 

< < — < — < 

CD — CD — CD — CD 
U — U — CJ — CJ 
En — Eh — Eh — Eh 
< — < — < — < 
CJ — CJ — CJ — CJ 

< — < — < — < 

CJ— CJ— u— u 
Eh — Eh — Eh — Eh 
CD — CD — CD — CD 

a — cj — cj — cj 

in in in m 
n n on 
^ *r 



CJ — CJ — o — o 
Eh — Eh — Eh — Eh 
U CJ CJ CJ 

g=gz:g-g 

Eh — Eh — Eh — Eh 
CJ — CJ — CJ — CJ 

g=g=g=g 

Eh — Eh — Eh — Eh 
CD — CD — CD — CD 
CD — CD — CD — CD 

5=5=5=5 

CJ — CJ — CJ — CJ 

CD— CD CD — CD 

CJ CJ CJ — CJ 

< — < — < — < 
CD — CD — CD — CD 

CJ CJ— CJ — CJ 

< — < — < — < 
CD — CD— CD— CD 

g=g=g=g 
CJ— CJ — CJ— CJ 

g=s=g=g 

CD CD— CD — CD 

U — CJ CJ — U 

CD — CD — CD — CD 
CD — CD — CD — CD 
< — < — < — X 

cj — cj — cj — a 
cj — cj — u — CJ 
cj — u — cj — CJ 

H Eh — Eh — Eh 

CJ — CJ — CJ 
Eh — EH — Eh 



4 



S= 



CJ-.— CJ — CJ 
Eh Eh — Eh 



CJ 

EH 

■Eh EH 



Eh _ 

CJ CJ CJ CJ 

CJ — CJ — CJ — CJ 
< — < — < — nC 
0 Eh — Eh 0 
Eh — Eh — Eh 
Eh — H — Eh 

g=g=g=g 

5=5=5=5 
^i=y-B=8 

CD — CD — CD — CD 
g— CD— g— CD 

EH — EH EH EH 

g=g=g=:g 

Eh — Eh — Eh — Eh 

CJ — CJ — CJ CJ 

■H 0 Eh — EH 
Eh — Eh — EH — |h 
Eh — Eh — Eh — Eh 
O — CJ — CJ — CJ 
Eh — Eh — Eh — Eh 
(0 CD — CD CD 

r»« r- t»«. in 
o o o vo 
in in in oo 



lO 
I 

o 



U- 
Eh- 
Eh- 

5: 

CD- 
U- 
Eh- 
<- 



CM 

in 



-cj — o — CJ 
-Eh — Eh o 
-Eh — Eh O 

:S=5-S 

-CD — CD -P 
-U — U — CJ 

-EH EH — Eh 

— < — < 

cn cn 

p* r*- m 

in in av 
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10 20 30 40 

GAATTCGGACGAC3GCAAGGTTCCAATTGCTCTATC31ATCCC6GCCATAT 

HCVl CXCTCCCAGGCGCCACTGGACGACGCAflGGTTGCAATIXSCTCTATCTATC 

550 560 570 580 590 600 

50 60 70 80 A 90 100 

AACAGGTCACCGCATGGCATGGGAIATGMGATGAACTGGTCCCCTACGACGGCGTTAGT 
::« t ::::: s ::::::::::::::::::: i j : j : t !» j : i j:: j :::::: • 

AACGGGTCACCGCATGGCATGGGATATGAIGATGAACTGGTCCCCTACGACGGCGTTGGT 
610 620 630 640 650 660 

110 120 130 140 150 160 

GG TAGCTCAG CTG CTCCGGATCCCACAAGCCATCTTGGACATG ATCGCTGGTGCTCACTG 

AATGGCTCAGCTGCTCCGGATCCCACAAGCCATCTTGGACATGATCGCTGGTGCTCACTC 
670 680 690 700 710 720 

170 180 190 200 210 220 

GGGAGTCCTGGCGGGCATAG03TATTTCTCCATGGTGGGGAACTGGGCGAAGGTCTTCGC 

^LiLLLLZlLLL* 1 88 1 8 1 s 5 8 5 : 8 s : : : ! 5 : s ! s : : : J : ' 8 * J « * * * * « » « : •> : 

GGGAGTCCTGGCGGGCATAGCGTATTTCTCCATGGTGGGGAACTGGGCGAAGGTCCTGGT 
730 740 750 760 770 780 

k 230 240 250 260 270 280 

AGTGCTGCTGCTATTSGCCGGCGTCGACGCGGAAACCCACGTCACTGGGGGGATCGCCGC 
:::: :i s:::::::::::::::::::::.,.,,........... . :t::: : :::: • 

agtcctcctccxatttgccggcgtcgacgcgGaaacccaogtcaccgggggaagtgccgg 

790 800 810 820 830 840 

29° 300 310 320 330 340 

caaaactacggctagccttactggtctcttcaatttaggtcccaagcagaacatccagct 

: ! ■ : : s it i is : : ::: :: j j • ....... 

CCACACTGTGTCTGGATTTGTTAGCrTCXrrcGCACCM 

850 860 870 880 890 900 

350 360 370 380 390 400 

gatcaacaccaaoggcagtiggcacatcaacaggacggccttgaactgcaatgat^ 

.... . . . . . . s , { ...... . . . 

GATCAACACC^CGGCAGTTGGCACCTCAATAGCACGGCC^ 

910 920 930 940 950 960 

410 420 

CAACACCGGCTGGAATTC 
J :«;:::::: s :X 

CAACACCGGCTGGTTGGCAGGGCTTTTCTATCACCACAAGTTCAACTCTTCAGGCTGTCC 
970 980 990 1000 1010 1020 

FIG.8 
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A 

vs 



AAJ3Q0-43B ( C-tarmtnat ranion of th» niitathip 
ammo -m «f flfrn 

1) JH23 ? 

2) JH27 ? 

3) Japanese isolate (T. Miyamura) ? 

4) EC10 (Italy) 2 don* 

(one nt c 

result in a-m^noed 

5) HCV-1 (chimpanzee) multiple *erence, which did not 

S < ■ ) ns I amino acid change) 

1) d A v 

2) D 
3) 

5)TTQGCNCSIYPGHrrGHRfMV\TOMMMNWSPTTA^ V 

1 ) M - AO «. A « MWfi ^ 

' R ARSTA Va 

~J . T YT N A R TQALT F 

3 ' L Y I M QH R VQ VT TLT 

4 ^ A i A K TASLTA 

5)HWG\OAGIAYFSMVGNWAKVLVVLllPAGWA£TH\^GGSAGHTVSGFVSL 

1) FS R | | TV 

2) FT Dl I R AD 

3) FR S Kl V I R q p 

4) FNL I | R || 

5) LAPGAXWQLINTNGSWHlJ^AIJOiDSLNTGWL 

SUMMARY: NS 1 AA 330-660 

"Isolate" ZHooology (AA330-438) ^Homology (A A 383-405) 

JH23 83 57 

JH27 80 39 

Japanese 73 48 

EC10 (Italy) .84 48 

FIG. 1 0 
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AA #117-308 fntitaW Pm/ a | OPB runtnnl FIG.9 

1) HCT #18 (USA) 3 clones sequenced 

2) JH23 (USA) ? 

3) JH 27 (USA) ? 

4) PBL-Th (USA) 2 clones sequenced 

5) EC1 (Italy) 3 clones sequenced 

6) HCV-1 (chimpanzee) multiple 

C/M+-j->S 

1) (P) 

2) 
3) 
4) 
5) 

©)roa.GKv1DTLT(X3FA^^ 

1) H 
2) 

3) S T T 

4) L 

5) . CF) s 

6) PGC^FSIFUJUJ^CLTVPASAYQVRNSTGLYHV7NDCPNSSIVYEAADAILH 

1) (& V v T 

2) A D V V K T 
3 * s FVA N 

4 >* ART 
5) H V 7 

^TPGCVPCVTUEGNASRCWVAMTPTVA^ 
D 

2) I D 

3) D 
4) 

5) I 

6) ALYViaDLCGSVRVGQLFTFSP 
STOWARY: W S" AA117-308 (93Z) 

HCT#18. PBL-Th, ECl(Italy) have 97X homology with HCV-l 

JH23 and JH 27 have 96Z and 95X horology with HCV-l .respectively 
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